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Methods and Agents for Diagnosis and Prevention, Amelioration or 
Treatment of Goblet Cell-Related Disorders 

5 FIELD OF THE INVENTION 

The present invention inter alia relates to methods for the 
prevention, amelioration or treatment of medical conditions associated with an 
alteration in normal goblet cell function. It also relates to methods of screening for 
disease-relevant markers indicative of an increased risk of a subject of developing 

10 such a condition, as well as to methods of screening for and diagnosis of a 
predisposition in a human subject for such conditions. It furthermore relates to an 
animal model useful for studying said medical conditions and the molecular 
mechanisms underlying it, and uses of that animal model, for example for the 
identification of diagnostic markers or agents useful for the prevention, 

15 amelioration, or treatment of a goblet cell-related disorder. 

Novel agents such as polypeptides and fragments thereof, nucleic acids and 
antibodies which are useful in the above methods, and novel pharmaceutical 
compositions are likewise provided. The invention further relates to screening 
methods for agonists and antagonists useful for performing said methods. These 

20 and further aspects of the invention will be described in more detail below. 

BACKGROUND OF THE INVENTION 

The epithelial mucosal layer is a physical and chemical barrier 
important in protecting the animal body from dryness, harmful exogenous 

25 substances and pathogens. Mucus forms a gel layer covering the epithelial surface, 
acting as a semi-permeable barrier between the epithelium and the exterior 
environment Mucus serves many functions, including protection against shear 
stress and chemical damage, and, especially in the respiratory tree, trapping and 
elimination of particulate matter and microorganisms. The mucus layer on top of 

30 the intestinal epithelium is the barrier between the host's internal milieu and gut 
bacteria. In the vertebrate eye, the inner layer of the tear film consists of mucous 
secretion products. Mucus is a viscous fluid composed primarily of highly 

1 
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glycosylated proteins called mucins suspended in a solution of electrolytes 
(Dekker et al., 2002). Mucins and other components of mucus are secreted from 
the apical surface of specialized columnar epithelial cells referred to as goblet 
cells (Verdugo, 1990). 
5 Goblet cells are distributed among other cells in the epithelium of 

many organs, especially in the intestinal and respiratory tracts. In areas like the 
conjunctiva, their numbers are rather small compared to other cell types, whereas 
in tissues such as the colon, they are much more abundant. Goblet cells have a 
characteristic morphology, based on membrane-bound secretory granules, which 

10 contain mucus (Specian and Oliver, 1991). 

The goblet cells' function is the secretion of mucins and other 
products, including protease resistant peptides - like the trefoil peptide family, 
which protect epithelium from injury and promote repair through restitution of 
epithelial cells (Podolsky, 2000). Secretion of mucus occurs by exocytosis of 

15 secretory granules (Verdugo, 1991). Mucins have the ability to hydrate and form a 
viscous gel, producing a protective scaffold overlaying epithelial surfaces. 

Constitutive or basal secretion occurs at low levels and is 
essentially unregulated and continuous. Stimulated secretion corresponds to 
regulated exocytosis of granules in response to extracellular stimuli such as 

20 hormones, neuropeptides and inflammatory mediators (Jackson, 2001; Laboisse et 
al., 1996). This pathway provides the ability to dramatically increase mucus 
secretion. The lumen of the intestinal tract inevitably contains numerous 
secretagogue irritants like gut bacteria (Deplancke and Gaskins, 2001). In the lung 
irritants such as dust and smoke are potent inducers of goblet cell secretion 

25 (Maestrelli et al., 2001). Besides stimulated exocytosis of stored mucin granules, 
prolonged exposure to secretagogue substances induces mucin gene expression 
and goblet cell hyperplasia (Ahlstedt and Enander, 1987; Maestrelli et al., 2001; 
Nadel, 2001). Epithelial cell differentiation in mucosal tissues has been studied to 
some detail in the gastrointestinal tract endodenn and the bronchial airways 

30 (Nadel, 2001; van Den Brink et al., 2001). In the intestinum, goblet cells 
differentiate from a multipotent stem cell, which gives rise to four epithelial cell 
types: enterocytes, goblet, enteroendocrine and Paneth cells (Yang et al., 2001), 
Recent genetic data provided evidence that the transcription factors Mathl, Klf4 
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and Elf3 as well as the GTPase Racl are required for intestinal goblet cell 
differentiation in mice (Katz et al., 2002; Stappenbeck and Gordon, 2000; Yang et 
al., 2001). In airways, ligands of the epidermal growth factor receptor have been 
proposed to stimulate epithelial cell differentiation and mucin expression (Nadel, 
5 2001). 

As an alternative approach to identify genes involved in epithelial 
function we performed a genome wide screen for mutations influencing epithelial 
functions in mice, e.g. nutrient absorption by intestinal mucosa. Within this 
screen, a variant C3H mouse was identified which suffered from chronic diarrhea 

10 and impaired thriving. This mouse was fertile and the phenotype was transmitted 
to its offspring in a recessive fashion. This novel mouse variant is referred to as 
"MTZ" hereafter. Histological analysis demonstrated that the primary defect 
responsible for the observable phenotype in the novel C3H variant is a defective 
differentiation, particularly terminal differentiation, or function of goblet cells in 

15 its intestinal mucosa. The responsible mutation was identified by positional 
cloning and shown to result in an amino acid exchange within a known gene. This 
gene has been referred to as "anterior gradient 2" (Agr2) by the Mouse Genome 
Informatics database. Expression of the corresponding cDNA was described in 
murine intestinal tissues, specifically in intestinal goblet cells, by in situ 

20 hybridization (Komiya et al., 1999). The human orthologous gene, which encodes 
a protein with 91% amino acid identity, when compared to the mouse Agr2 gene, 
Eas been referred to as "Anterior gradient 2 homolog" (AGR2). 

Human AGR2 (also termed BCMP7 and XAG-1) is a known 
protein. For example, WO 98/07749 discloses human growth factors, including a 

25 sequence identified as huXAG-1, which corresponds to human AGR2 and is 
suggested in that reference to be a growth factor and marker for colon cancer. 

WO 99/53040 discloses a large number of sequences derived from 
an EST database, including sequences (identified as sequences ID 265 and 288), 
which correspond to AGR2. 

30 WO 99/55858 again discloses a large number of sequences derived 

from an EST database, including sequences (identified as sequences ID 8 and 
181), which correspond to AGR2 and are indicated as being more highly 
expressed in pancreas cancer tissue. 
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WO 00/53755 discloses a sequence (PRO 1030), which 
corresponds to AGR2. Using gene copy amplification, it is reported that the 
number of gene copies are increased in primary lung and colon tumor. 

Sequences corresponding to AGR2 are also disclosed in WO 

5 99/40189. 

In US patent application 2002111303, AGR2 (referred to therein as 
BCMP 7) is predicted to be an extracellular protein with an N-terminal signal 
sequence and suggested to be a marker for breast cancer and prostate cancer. 

Human AGR2 mRNA was shown to be expressed in trachea, lung, 

10 stomach, colon, prostate and small intestine (Thompson and Weigel, 1998). 

cDNA sequences relating to human AGR2 are referred to in US 
6,312,922 (SEQ ID NOS:61 and 149). 

The actual function of the AGR2 protein on the cellular level or on 
the level of the organism has not been described in mammals up to now. The only 

15 functional analysis of a protein homologue to AGR2 has been performed in 
Xenopus laevis, published by Aberger et al. (Aberger et aL, 1998). The authors 
demonstrated that overexpression of XAG-2 induces both, ectopic cement gland 
differentiation and expression of anterior neural marker genes in Xenopus 
embryos. However, a Xenopus protein with the highest degree of amino acid 

20 identity, when compared to murine and human AGR2, is the protein CGS 
(EMBL/GenBank/DDBJ databases accession number AAL26844; TrEMBL entry 
Q90Y05), exhibiting 59% amino acid identity to murine Agr2 protein, and 
exhibiting 60% amino acid identity to human AGR2, respectively. The function of 
CGS, the putative AGR2 orthologue in Xenopus laevis, is not described yet. 

25 In a detailed study we analyzed the RNA expression profile of the 

mouse Agr2 gene and the human AGR2 gene. The phenotype observed in the 
mouse model described herein demonstrates for the first time that Agr2 function is 
required for normal goblet cell function in a mammalian model organism. 

Altered mucus production has been implicated in various diseases, 

30 e.g. asthma, chronic obstructive pulmonary disease (COPD), and cystic fibrosis, 
which are characterized by increased mucus production. Diseases like dry eye 
syndrome, gastric disease, peptic ulcer, and inflammatory bowel disease are 
characterized by decreased mucus production. Altered mucus production is also 

4 
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described in malignancies like colorectal cancer (Corfield et al., 2001; Einerhand 
et al., 2002; Fahy, 2001; Forstner, 1978; Jass and Walsh, 2001; Maestrelli et al., 
2001; Melton, 2002; Puchelle et al., 2002; Schreiber et al. s 2002; Slomiany and 
Slomiany, 2002; Velcich et al., 2002; Voynow, 2002; Watanabe, 2002). 
5 Therefore, great efforts are made in biomedical research to understand the 
mechanisms that are involved in epithelial cell differentiation, in the regulation of 
mucus production, in mucus secretion and in the maintenance of intact mucosal 
surfaces. Several strategies of modulating mucus production have been proposed 
(see the following patents and patent applications), e.g. by LTB4 antagonists (WO 
10 02/55065), EGF receptor antagonists (WO 02/05842), polycationic peptides (US 
6,245,320), KGF (WO 94/23032) (Farrell et al., 2002) and KGF-2 (WO 
99/41282). Several scientific reviews have been published recently covering 
epithelial cell differentiation in different tissue types containing mucus producing 
cells (Bhat, 2001; Brittan and Wright, 2002; Daniels et al., 2001; Emura, 2002; 
15 Foster et al., 2002; Otto, 2002). However, there has been no suggestion of an 
involvement of the AGR2 gene or its gene product in mucus production. 

The invention described herein demonstrates for the first time that 
AGR2 is required for normal goblet cell function, in particular mucin secretion. 
The invention therefore opens novel opportunities for the diagnosis and treatment 
of said diseases involving malfunction of mucus producing tissues or any other 
condition, for which modulation of mucus production might have a therapeutic 
effect. 

SUMMARY OF THE INVENTION 

In a first aspect, this invention provides a non-human animal useful 
as a model of goblet cell related disorders in humans, such as asthma, chronic 
obstructive pulmonary disease (COPD), cystic fibrosis, dry eye syndrome, gastric 
disease, peptic ulcer, inflammatory bowel disease, in particular Crohn's disease or 
ulcerative colitis, and malignancies like colorectal cancer. 

In one embodiment, the animal of the invention carries a mutated 
AGR2 gene encoding an AGR2 protein with a modified amino acid sequence 
compared to the wild type sequence. In one embodiment, the AGR2 protein may 
have a modified amino acid sequence that causes a loss of function phenotype. 
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Alternatively, the AGR2 may have a modified amino acid sequence that causes a 
gain of function phenotype. 

The present invention also relates to methods using the animal 
model of the invention for the study of disorders associated with mutations in 

5 AGR2. In one embodiment, the invention provides methods of diagnosis for 
deficiencies or overproduction in AGR2, or the gene encoding it In another 
embodiment, the invention provides a method for screening of preventive or 
therapeutic agents of disorders and symptoms associated to AGR2 mutations, 
using the animal model of the invention. 

10 Furthermore, the present invention provides mutated AGR2 nucleic 

acids and polypeptides (also referred to as "muteins") having modified sequences 
compared to the wild type sequences. These mutated nucleic acids and 
polypeptides may also be used in the diagnostic and therapeutic methods 
contemplated herein. In a specific embodiment, an AGR2 mutein carries an amino 

15 acid substitution at residue 137, as shown in SEQ ID NO:30. 

Uses of the AGR2 muteins as modulators (whether agonists or 
antagonists) of endogenous AGR2 activity are also contemplated. Consequently, 
pharmaceutical compositions comprising the AGR2 muteins of this invention are 
contemplated further comprising a pharmaceutical^ acceptable carrier. 

20 Specifically we contemplate use of the AGR2 muteins of the present invention, 
the polynucleotide encoding them and vectors bearing the polynucleotides for the 
prevention, treatment or amelioration of a medical condition in a mammalian 
subject, particularly a human subject, and in particular their use for the 
development of a measure for the prevention, treatment or amelioration of any 

25 medical conditions characterized by goblet cell abnormalities or mucus 
production. 

One embodiment of the invention is related to a method for 
modulating the expression of a target gene in a eukaryotic cell when the target 
gene is T e gu la led "by- the^AGR2^prot ein rT he to ete ^inVrtv^th^tgp^O'^ 
30 modulating the activity of AGR2, i.e., of the wild type AGR2 or the AGR2 
mutein. While the method may be used on single cells, it is preferable to apply the 
method to a eukaryotic cell within a multicellular organism, for example, in a 
mammal such as a human, horse, dog, cat, sheep, rat, or a mouse, but also in other 
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vertebrates, such as amphibians, e.g., in Xenopus leavis. The eukaryotic cell 
within the above multicellular organisms may be a cell that expresses AGR2, in 
particular a goblet cell or a mucus secreting cell of, e.g., the Brunner's gland or 
the submucosal glands of the trachea. 
5 Another embodiment of the invention is related to a method for 

modulating the expression, in a cell of a mammal, of a target gene whose 
transcription is regulated by AGR2 protein. In the method, the activity of AGR2 is 
modulated, i.e., the activity of the wild type or mutein AGR2, and the modulated 
AGR2 will, in turn, modulate the expression of the target gene. The method will 

10 work on all animals, for example in mammals such as human, horse, dog, cat, 
sheep, rat, or mouse, but also in other vertebrates, such as amphibians, e.g., 
Xenopus leavis. The method is particularly useful in cells which express AGR2, 
such as goblet cells or mucus secreting cells of, e.g., the Brunner's gland or the 
submucosal glands of the trachea. 

15 The activity of AGR2, i.e., of the wild type or mutein AGR2, may 

be modulated in a number of ways, such as, for example, altering the state of 
posttranslational modification. For example, if the target gene is responsive to a 
phosphorylated AGR2, the phosphorylation state of AGR2 protein may be 
increased to increase the activity of the target gene. Conversely, if it is desired to 

20 reduce the activity of the gene, the phosphorylation state of AGR2 may be 
decreased. 

As another example, if the target gene is responsive to a 
dephosphorylated state of AGR2, the phosphorylation state of AGR2 is decreased 
to increase the activity of the target gene. In this case, the phosphorylation state of 

25 AGR2 may be increased to reduce the activity of the target gene. 

The modulation may involve both an increase of AGR2 activity, 
i.e., of the wild type or mutein AGR2, or a decrease of AGR2 activity, i.e., of the 
wild type or mutein AGR2. Any method that can increase or decrease AGR2 
activity may be used. For example, AGR2 may be decreased by contacting an 

30 AGR2 expression inhibitor with an AGR2 mRNA to prevent protein translation or 
promote mRNA decay. The AGR2 expression inhibitor may be a biomolecule 
such as a nucleic acid. For example, the nucleic acid may be an antisense nucleic 
acid (DNA, RNA, PNA or other synthetic nucleic acid analogs), an siRNA 
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molecule, or an aptamer. The nucleic acid may be a ribozyme specific for AGR2 
mRNA. In all cases where a nucleic acid is used, the nucleic acid may be designed 
to differentiate between a nucleic acid encoding a mutated protein from a wild 
type nucleic acid. For example, the ribozymes and antisense nucleic acids may be 
5 designed to hybridize in a sequence specific manner to the sequence encoding the 
mutated AGR2 but not to the sequence encoding the wild type AGR2. 

Alternatively, the ribozymes and antisense nucleic acids may be 
designed to hybridize in a sequence specific manner to the sequence encoding the 
wild type AGR2 but not to the sequence encoding the mutated AGR2. 

10 As a further example, a ribozyme discussed above may be 

comprised of a hybridizing region and a catalytic region. A ribozyme designed to 
affect AGR2 expression will, naturally, contain a hybridizing region that is 
capable of hybridizing to at least part of a AGR2 mRNA sequence. Further, the 
ribozyme would contain a catalytic domain capable of cleaving the AGR2 mRNA 

15 sequence to reduce or inhibit AGR2 gene expression. The hybridizing region may 
be constructed to hybridize only to a sequence encoding a mutated AGR2 and not 
to a sequence encoding wild type AGR2. Alternatively, the hybridizing region 
may be constructed to hybridize only to a sequence encoding a wild type AGR2 
and not to a sequence encoding mutant AGR2, i.e., the hybridization region does 

20 not comprise a part of the AGR2 rautein sequence encompassing the mutation. 
Conversely, the hybridizing region may be constructed to hybridize to all 
sequences encoding AGR2 regardless of whether the protein is wild type or 
mutant. 

In another embodiment, the biomolecule, discussed above, may be 
25 a protein. The protein may be an antibody, a fragment of an antibody, or an 
anticalin. These antibody and antibody fragments may show specificity in binding 
the AGR2 protein, i.e., the wild type or mutein AGR2. While antibodies and 
antibody fragments with high specificity are preferred, lower specificity 
antibodies and fragments are also contemplated by this invention. A lower 
30 specificity antibody or fragment may be useful, for example, if the antibody does 
not interfere with other cellular functions. 

Preferably, the specificity of the antibodies and antibody fragments 
is sufficient so that they do not bind any other protein in the cell. High specificity 

8 
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may be achieved by using monoclonal antibodies. Methods for making 
monoclonal antibody are well known. Other methods for making polyclonal 
antibodies, such as, for example, by injection into animals are also known. High 
specificity polyclonal antibodies may be produced, for example, by using a 
column bound with proteins from a cell not expressing AGR2 (i.e., column 
chromatography) of polyclonal antibodies. Such a column would remove 
nonspecific antibodies. Other techniques for purifying antibodies are known in the 
art. 

Another embodiment of the invention is related to a mutant AGR2 
polypeptide, comprising, e.g., an amino acid substitution at the position 
corresponding to residue 137 of SEQ ID NO:2. The polypeptide may contain at 
least 6 amino acids, preferably at least 7 amino acids, more preferably at least 8 
amino acids, even more preferably at least 9 amino acids and most preferably at 
least 10 amino acids. Longer peptides, such as, for example, the complete AGR2 
protein containing an amino acid substitution at position 137 are, of course, 
contemplated because the complete protein is longer than the limit of at least 6, 7, 
8, 9 or 10 amino acids stated above. 

The amino acid substitution is the substitution of a codon encoding 
valin at position 137 to a codon encoding a non-valin substitution. The genetic 
code is known so the types of substitution claimed are known to one of skill in the 
art. One example of substitution may be one in which valin is substituted by an 
acidic amino acid such as glutamic acid or aspartic acid. Another example of 
substitution may be one in which valin is substituted by a glycine or proline. 
Another example of substitution may be one in which valin is substituted by a 
basic amino acid (histidin, arginin or lysin), aliphatic hydroxyl side chain amino 
acid (serine, threonine), aromatic side chain amino acid (phenylalanine, tyrosine, 
tryptophan), amide side chain amino acid (asparagine, glutamine), sulfur 
containing side chain amino acid (cysteine, methionine) or aliphatic side chain 
amino acid (alanine, leucine or isoleucine). 

One embodiment of the invention is related to a nucleic acid 
segment that encodes a polypeptide fragment of AGR2 where the polypeptide 
fragment comprises an amino acid substitution corresponding to residue 137 of 
the full length AGR2. The amino acid substitution may be the replacement of the 
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codon encoding residue 137 with any codon that do not encode valin. A codon 
that does not encode valin may be, for example, a codon that encode Phe (TTT, 
TTC); Leu (TTA, TTG, CTT, CTC, CTA, CTG); He (ATT, ATC, ATA); Met 
(ATG); Ser (TCT, TCC, TCA, TCG), Pro (CCT, CCC, CCA, CCG); Thr (ACT, 
5 ACC, ACA, ACG), Ala (GCT, GCC, GCA, GCG); Tyr (TAT, TAC); His (CAT, 
CAC), Asp (GAT, GAC); Gin (CAA, CAG); Asn (AAT, AAC); Lys (AAA, 
AAG); Glu (GAA, GAG); Cys (TGT, TGC); Trp (TGG); Arg (CGT, CGC, CGA, 
CGG, AGA, AGG); Ser (AGT, AGC); or Gly (GGT, GGC, GGA, GGG). Of all 
the substitutions stated above, a nucleic acid that encodes a substitution of valin to 
10 glutamic acid (GAA, GAG) at codon 137, as shown in SEQ ID No:2 is most 
preferred. 

The nucleic acid of the invention may be part of a recombinantly 
generated episomal element. Episomal elements may be, for example, a plasmid, 
cosmid, bacterial phage nucleic acid, or a viral nucleic acid. The recombinantly 

15 generated nucleic acid may be a part of a genome, such as a bacteriophage 
genome, a bacteria genome, or virus genome. Virus genomes may be a DNA viral 
genome, or an RNA viral genome (both + strand virus or - strand virus). 

In another embodiment, the invention is related to vectors 
comprising a nucleic acid segment that encodes a polypeptide fiagment of AGR2 

20 where the polypeptide fragment comprises an amino acid substitution 
corresponding to residue 137 of the full length AGR2. The vector may be an 
expression vector, a mutagenesis vector, an integration vector or a mutation 
vector. Expression vectors are well known in the art and include plasmid vectors, 
cosmid vectors, phage vectors, phagemid vectors, viral vectors, retroviral vectors, 

25 and the like. 

The invention also contemplates a host cell transfected with one of 
the vectors and nucleic acids described above. A host cell may be, for example, a 
eukaryotic cell or a prokaryotic cell. A host cell transformed with a nucleic acid 
that is not a vector may be, for example, a cell transformed with antisense DNA or 
30 aribozyme. 

Another embodiment of the invention is related to a method of 
producing a mutant AGR2 protein. In the method, a host cell transfected with a 
nucleic acid that encodes a polypeptide fragment of AGR2 where the polypeptide 
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fragment comprises an amino acid substitution corresponding to residue 137 of 
the full length AGR2 is cultured such that the nucleic acid is expressed. It should 
be noted that an expression vector may be desirable but is not required. For 
example, in transient expression, vector sequences are not required for expression. 
5 The cultured cells are then harvested and the mutant AGR2 protein is purified 
from the cells. While purification to homogeneity may be desirable, it is not 
necessary. Purification may involve merely making a lysate from bacteria that 
expressed AGR2. In this example, the AGR2 protein is purified because it is no 
longer associated with the proteins it was naturally associated with (i.e., 
10 eukaryotic proteins). As another example, a mouse Agr2 protein expressed in a 
human cell is also purified because it is no longer associated with the proteins 
(mouse proteins) that it is naturally associated. 

Another embodiment of the invention is related to a composition 
for inducing an altered condition in a patient The composition may comprise a 
15 mutant AGR2 polypeptide containing a substitution mutation that corresponds to 
residue 137 or any other AGR2 mutein described herein. Examples of wild type 
AGR2 proteins are shown in SEQ ID NO:3, or SEQ ID NO:4. Thus, a polypeptide 
with a substitution mutation in codon 137 of SEQ ID NO:3 or SEQ ID NO:4 may 
be an ingredient in the composition. The substitution mutation may be the 
20 substitution of valin at position 1 37 with a non valin amino acid. The composition 
may also comprise a wild type AGR2 protein, e.g., a protein according to SEQ ID 
NO:4. In addition, the composition may contain a pharmaceutical^ acceptable 
carrier. 

Another embodiment of the invention is related to a method of 
selectively inhibiting the expression, in a eukaryotic cell of a gene whose 
transcription is negatively or positively regulated by AGR2. The eukaryotic cell is 
preferably a mammalian cell, preferably a cell derived from a human, horse, dog, 
cat, sheep, rat, or a mouse, but also derived from other vertebrates, such as 
amphibians, e.g., from Xenopus leavis. The method is also related to cells within 
the afore-mentioned animals, preferably within a human. The eukaryotic cell may 
be a cell that itself expresses AGR2, in particular a goblet cell or a mucus 
secreting cell of, e.g., the Brunner's gland or the submucosa of the trachea. 
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Another embodiment of the invention is related to a method for 
expressing an AGR2 protein with alterered activity. In the method, a host cell with 
an episomal element that comprises a cDNA which encodes AGR2 protein with, 
e.g., a substitution mutation, wherein the mutation is a substitution of valin at 
5 position 137 with an amino acid that is not valin is provided. Then the host cell is 
cultured such that the mutant AGR2 protein is expressed. 

Another embodiment of the invention is related to an antisense 
nucleic acid molecule of a length sufficient to inhibit the expression of an AGR2 
protein, i.e., a wild type AGR2 protein or an AGR2 mutein. An antisense nucleic 
10 acid molecule sufficient to inhibit total cellular AGR2 protein biological activity 
is also contemplated. The antisense nucleic acid molecule is complementary to a 
mammalian AGR2 nucleic acid sequence such as human AGR2 sequence, mouse 
Agr2 sequence, or rat AGR2 sequence. The biological activity to be inhibited may 
be goblet cell function, e.g., mucus production, or the proliferation of mucus 
15 secreting cells of, e.g., the glandular epithelium of the Brunner's gland. The 
activity may be inhibited by at least 5%, 10%, 15%, 20%, 25%, 50%, 75% or 
100%. The antisense nucleic acid may be at least 15 nucleotides in length. 

Another embodiment of the invention is related to a ribozyme. The 
ribozyme comprises a hybridizing region and a catalytic region. The hybridizing 
20 region is capable of hybridizing to at least part of a target mRNA sequence 
transcribed from a genomic AGR2 sequence and the catalytic domain is capable 
of cleaving the target mRNA sequence to reduce or inhibit AGR2 function, i.e., 
the function of a wild type AGR2 protein or an AGR2 mutein. 

Another embodiment of the invention is related to an siRNA 
25 molecule. The siRNA molecule is designed in a way to efficiently inhibit the Agr2 
gene expression, i.e., the gene expression of the wild type AGR2 or the AGR2 
mutein, by gene silencing. 

A further embodiment of the invention is related to an aptamer. 
The aptamer is designed in a way to efficiently bind AGR2, i.e., the wild type 
30 AGR2 or the AGR2 mutein. Preferably, the specificity of the aptamers is 
sufficient so that they do not, or substantially do not, bind to any other protein in 
the cell. 
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Another embodiment of the invention is related to a pharmaceutical 
composition, which comprises a nucleic acid molecule that inhibits or otherwise 
reduces AGR2 mediated function, i.e., wild type AGR2 or AGR2 mutein function. 
The nucleic acid is at least about ten nucleotides in length and hybridizes to an 
5 AGR2 mRNA molecule or forms a heteroduplex with a AGR2 mRNA molecule. 
The nucleic acid molecule may be an antisense molecule or an siRNA molecule. 
The pharmaceutical composition, in addition to the nucleic acid described, further 
comprises one or more pharmaceutical^ acceptable carriers. 

Another embodiment of the invention is related to a transgenic 
10 non-human mammal all of whose germ cells and somatic cells contain a mutated 
AGR2 gene, which was introduced into the mammal, or one of its ancestors, at an 
embryonic stage. The transgene - a mutated AGR2 gene - encodes, e.g., an 
amino acid substitution mutation at the position corresponding to amino acid 137 
of the AGR2 protein. 

15 The mutated AGR2 protein of the transgenic mammal above may 

be derived from a wild type AGR2 protein sequence. Wild type AGR2 proteins 
are listed in SEQ ID NO:3 or SEQ ID NO:4. A mutated version of the AGR2 
protein would contain the sequence of SEQ ID NO:3 or SEQ ID NO:4 but with a 
substitution mutation at, e.g., amino acid 137. The substitution mutation may be 

20 the substitution of valin at position 137 with a non-valin amino acid. The 
transgenic mammal may further contain a knockout wild type AGR2 gene. 
Furthermore, the knockout wild type AGR2 gene may be homozygous such that 
the transgenic animal contains no wild type AGR2. In this case, the only AGR2 
gene in the transgenic animal is the mutated AGR2 gene. Naturally, since the only 

25 AGR2 gene is the mutated one, the only AGR2 protein in the transgenic animal is 
the mutated AGR2 protein. There are multiple methods of constructing an animal 
with knockout endogenous AGR2 and a functional mutant AGR2. One method is 
to knockout both endogenous AGR2 genes by homologous recombination. An 
easier method may be to knockout one of the endogenous AGR2 gene and breed 

30 this knockout AGR2 locus to homozygosity. The introduction of a mutant AGR2 
gene may be part of the knockout construction. That is, the genetic construct 
designed to target the endogenous AGR2 gene may itself contain a mutant AGR2 
gene. Thus, the gene knockout and the introduction of a mutant AGR2 gene may 
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be performed concomitantly. Alternatively, a knockout animal line (homozygous 
or heterozygous) may be used to produce transgenic animals using a mutated 
AGR2 DNA construct. Finally, a knockout AGR2 animal line may be crossed 
with a transgenic animal carrying a mutant AGR2 gene. Animals homozygous for 
AGR2 knockout and for carriers of a mutant AGR2 can be made using standard 
genetic techniques. 

In the cases where the mutant AGR2 gene construct is used to 
produce a transgenic animal, the gene construct may further comprise a promoter 
sequence different from the promoter sequence controlling the transcription of the 
endogenous AGR2 coding sequence. Thus, mutant AGR2 may be expressed in 
any desired tissue depending on the choice of promoter sequence. Further, the 
promoter sequence may be from an inducible promoter. While the transgenic non- 
human mammals of this invention may be any mammal, one preferred animal is a 
rodent such as a rat or a mouse. 

Another embodiment is related to the use of the nucleic acids of the 
invention for in vivo delivery and expression. This approach has also been called 
gene therapy. It should be noted that to be useful, gene therapy does not need to be 
completely efficacious. A method of gene therapy that can alleviate a symptom of 
a mammalian disorder is envisioned by the instant disclosure. Gene therapy is 
known in the art. This term has been used to describe a wide variety of methods 
using recombinant biotechnology techniques to deliver a variety of different 
materials to a cell. Such methods include, for example, the delivery of a gene, 
antisense KNA, an siRNA molecule, an aptamer, a cytotoxic agent, etc., by a 
vector to a mammalian cell, preferably a human cell either in vivo or ex vivo. Most 
work has focused on the use of viral vectors to transform these cells. This focus 
has resulted from the ability of some viruses, to infect cells and have their genetic 
material integrated into the host cell with high efficiency. Viruses useful for this 
approach include retroviruses, adenoviruses, pox viruses (including vaccinia), 
herpes virus, etc. In addition, various non-viral vectors such as ligand-DNA- 
conjugates have been used. Transient expression of transgenes has been developed 
also by the use of non-integrative viral vectors with low replicative efficiency. 

Other embodiments of the invention are related to the use of the 
nucleic acids and proteins as described herein to alter or modulate, in a cell of a 
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mammal, the expression or activity of AGR2, i.e., the AGR2 wild type protein or 
mutein; or to their use to alter or modulate the expression of a target gene whose 
transcription is directly or indirectly regulated by AGR2 protein. 

The use described above, when applied to an animal such as a 
5 mammal (e.g., a human) have significant medicinal value. Thus, another 
embodiment of the invention is related to the use of the proteins and nucleic acids 
as described herein as a medicament. The medical composition may be used to 
prevent, to ameliorate, or to treat a disease such as asthma, chronic obstructive 
pulmonary disease (COPD), cystic fibrosis, dry eye syndrome, gastric disease, 
peptic ulcer, inflammatory bowel disease and malignancies like colorectal cancer. 
The medical condition or disease may optionally furthermore be associated with 
an increased proliferation of the glandular epithelium of the Brunner's gland. 

The proteins (i.e., all proteins described including AGR2 wild type 
or mutein, antibodies and other proteins), chemical molecules, including small 
molecules, e.g., small molecule agonists or small molecule antagonists, and 
nucleic acids of the invention may be applied to a patient using well known 
delivery methods as described infra. The medicament may be used for the 
modulation of goblet cell function. The compositions and medicament of the 
invention may be used to alter the biological activity of AGR2, i.e., the AGR2 
wild type protein or mutein. 

Further embodiments of the invention relate to the use of the 
vectors, episomal elements and/or host cells as described herein for prevention, 
amelioration, or treatment of those diseases associated with goblet cell activity or 
deficiency, such as asthma, chronic obstructive pulmonary disease (COPD), cystic 
fibrosis, dry eye syndrome, gastric disease, peptic ulcer, inflammatory bowel 
disease and malignancies like colorectal cancer and the use of the non-human 
animal model of the invention for the dissection of the molecular mechanisms 
physiological processes within which AGR2 is active, or which are influenced by 
AGR2. 

Further embodiments include the use of the non-human animal 
model of the invention for the identification of gene and protein diagnostic 
markers for diseases, or for the identification and testing of compounds useful in 



15 



WO 2004/056858 



PCT/EP2003/014834 



the prevention, amelioration, or treatment of those diseases associated with AGR2 
activity or deficiency, as described herein. 

The above embodiments and yet further embodiments of the 
present invention will be explained in more detail below. 

5 

DESCRIPTION OF THE FIGURES 

Figure 1 depicts the synthenic chromosomal regions of mouse and human 
bearing the AGR2 genes of both species (Fig.lA), and a 
comparison of the exon-intron structure (Fig. IB) of murine and 
human AGR2. Only coding exons are coloured in grey. Exon sizes 
are indicated by the number of basepairs either top of an exon (if 
coding exon) or below an exon (if non-coding exon). Intron sizes 
are depicted in length by basepairs. 

Figure 2 depicts an alignment of the murine and human wild type AGR2 
protein sequences, indicating the amino acid residues identity 
between the two proteins. The position of the mutation is 
highlighted in grey. 

Figure 3 depicts a chart diagramming the F3-production (Fig, 3A) and the 
outcross breeding schemes (Fig. 3B) used to map the mutation, 
associated with the observed phenotypic abnormalities, to mouse 
chromosome 12. Legend: thin parallel lines represent the two 
alleles of the genome, crossed thin lines represent mutation events; 
thick lines represent the wild type of a different mouse strain used 
for outcrossing. 

m WT indicates a male wild type; 
f WT indicates a female wild type; 
DB1 indicates a dominant breeding 1; 
RF1 indicates a recessive Fl x Fl; 
RBS indicates a recessive brother-sister; 
ROC indicates a recessive out-cross; 
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RIC indicates a recessive inter-cross. 

Abbreviations in miniscules indicate the animal involved in each 
breeding stage, their names indicating the stages from which they 
were generated. 

depicts final data from genome wide SNP analysis on affected F5 
MTZ mice leading to the assignment of the mutation to proximal 
chromosome 12, as performed by Pyrosequencing Technology. 

depicts a haplotype scheme of informative MTZ mice with 
chromosomal breakpoints defining the location of the mutation at 
chromosome 12 between marker Idb2 and marker D12Mit64. The 
symols "c", "hz" and "b" respectively, indicate C3H (c) mice, 
heterozygous (hz) mice, and c57B16 (b) mice, respectively. 

depicts data from a reverse transcribed polymerase chain reaction 
(RT-PCR) analysis, examining murine AGR2 mRNA expression at 
murine tissue cDNAs. The 349 bp band represents the PCR product 
specific for murine AGR2. 

depicts data from a reverse transcribed polymerase chain reaction 
(RT-PCR) analysis, examining human AGR2 mRNA expression at 
human tissue cDNAs. The 170 bp band represents the PCR product 
specific for human AGR2. 

depicts Northern blots hybridized with a human AGR2 probe. 

depicts a table listing genotypes and phenotypes of mice 
descending from the MTZ mouse originally identified in the 
genome wide mutagenesis screen. Mice carrying the missense 
mutation of the Agr2 gene on both alleles are marked as "muf \ 
whereas those carrying one mutated and one wild type allele are 
marked as "het". Mice carrying two wild type alleles at the Agr2 
locus are marked as "wt" All mice carrying the missense mutation 
of the Agr2 gene on both alleles display the MTZ phenotype, i.e. 
chronic diarrhea and reduced thriving, whereas all other mice were 
phenotypically normal. 
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Figure 10 depicts cross sections of the colon walls of a wild type mouse in 
C3H genetic background. The samples were formalin fixed and 
stained with anti-TFF3 (trefoil peptide 3) antibody and anti-murin 
Agr2 antiserum, respectively - indicating TFF3 and Agr2 
5 expression in goblet cells. 

Figure 1 1 depicts a cross section of the colon walls of an MTZ mouse in the 
C3H genetic background, and a respective wild type mouse used as 
a control. The samples were formalin fixed and stained with H7E 
(hematoxilin/eosin). In the wild type animal, goblet cells are 
10 characterized by their high content of vesicles storing pre-mucins 

and other components of mucus, which appear as light spherical 
droplets in the present staining. These droplets are almost absent in 
the colon epithelium of the MTZ animal. 

Figure 12 depicts a cross section of the colon walls of an MTZ mouse in the 
15 C3H genetic background. The samples were formalin fixed and 

stained with H/E (hematoxilin/eosin). The colon wall of the MTZ 
animal contains infiltrating inflammatory immune cells in the 
mucosal epithelium and submucosa, which are identifiable by their 
small size and the dark staining spherical nucleus (marked by an 
20 asterisk. In addition, microerosion of colonic mucosa is detected 

and marked by an arrow. 

Figure 13 depicts a cross section of the colon walls of an MTZ mouse in the 
C3H genetic background and a respective wild type mouse used as 
a control The samples were formalin fixed and stained with the 

25 flurescent labeled lectins wheat germ agglutinin (WGA), and with 

a Dolichos biflorus agglutinin (DBA). In the wild type animal, 
highly glycosylated mucins are identifiable by their light staining, 
which concentrates in spherical droplets stored by goblet cells. In 
contrast, these light staining droplets are almost absent in the colon 

30 epithelium of the MTZ animal. 

Figure 14 depicts a cross section of the duodenal wall of an MTZ mouse in 
the C3H background and a respective wild type mouse as a control. 
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The samples were formalin fixed and stained with H/E 
(hematoxilin/eosin). In the wild type animal, a normal Brunner's 
gland as well as normal duodenal epithelium are detected. In the 
MTZ animal the Brunner's gland is dilated and the duodenal 
5 epithelium is proliferating. In the MTZ animal the Brunner's gland 

is dilated and the duodenal epithelium is proliferating. A Brunner's 
gland is indicated by an asterisk, a duodenal epithelium is indicated 
by an arrow. 

Figure 15A depicts the results when applying the amino acids 1 to 30 from 
10 mouse Agr2 to the publicly available program "SignalP VI. 1" 

(Nielsen et al., 1997). The program predicts an N-terminal signal 
sequence encoded by the amino acids 1 to 20 and a cleavage site 
between amino acid 20 and 2 1 with a high probability. 

Figure 15B depicts the results when applying the amino acids 1 to 30 from 
15 human AGR2 to the publicly available program "SignalP VI. 1" 

(Nielsen et al., 1997). The program predicts a N-terminal signal 
sequence encoded by the amino acids 1 to 20 and a cleavage site 
between amino acid 20 and 21 with a high probability. 

Figure 16 depicts the comparison of the amino acid sequences of mouse, 
20 human, and rat Agr2 proteins. Amino acid identity of 91%, and 

amino acid similarity of 95% indicate evolutionary highly 
conserved amino acid residues. The conserved amino acids (i.e., 
identical or similar) are listed in accompanying Table L 

Figure 17 depicts the comparison of the amino acid sequences of mouse, 
25 human, rat, and Xenopus laevis Agr2 proteins. Amino acid identity 

of 67%, and amino acid similarity of 82% indicate evolutionary 
highly conserved amino acid residues. The conserved amino acids 
(i.e., identical or similar) are listed in accompanying Table 2. 

Figure 18 depicts the comparison of the amino acid sequences of mouse, 
30 human, rat, Xenopus laevis, and C. elegans Agr2 proteins. Amino 

acid identity of 32%, and amino acid similarity of 46% indicate 
evolutionary highly conserved amino acid residues. The conserved 
19 
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amino acids (i.e., identical or similar) are listed in accompanying 
Table3. 

Figure 19 depicts data from quantitative mKNA detection by PCR-Light 
Cycler technology on freshly prepared colon cDNA of MTZ and 

5 wild type control newborns. Elevated amount of Agr2 transcript is 

accompanied by reduced amounts of muc2 (mucin 2) and TFF3 
transcript Both genes, Muc2 and TFF3 encode proteins that 
comprise the major components of mucus. Same data have been 
established in assays with colon cDNA of adult MTZ and wild type 

10 control mice. Regulation of mRNA was determined as x fold 

change relative to the transcript amount of internal standard gene 
ALAS (aminolevulinic acid synthase 1). 

Figure 20 depicts Western blot data indicating secretion of AGR2 protein into 
the supernatant conditioned from colon cancer cell lines. 
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DETAILED DESCRIPTION OF THE INVENTION 

The various aspects and utilities of the present invention will be 
apparent from the following detailed description. 

20 The goblet cells referred to herein are cells, which are specialized 

with respect to mucus secretion via granules, in particular in the gastrointestinal 
tract (GI) (examples in this regard are goblet cells of the esophagous, of the 
stomach surface, of the pyloric glands, and of the intestinal epithelium), or in the 
respiratory tract (examples in this regard are goblet cells of the nose epithelium, of 

25 the trachea, of the bronchius, and of the submucosal glands of the trachea). 

The term "differentiation" as used herein in connection with goblet 
cells refers to all steps of cellular differentiation of a goblet cell from early 
differentiation to late differentiation and to terminal differentiation, i.e., to the 
mature mucus secrecting goblet cell. Thus, terminal differentiation of goblet cells 

30 means the last differentiation step to the mature goblet cell. 
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The term "mucus secreting cell" as used herein refers to cells which 
are specialized to mucus secretion without prior storage of the mucus in granules, 
e.g., the mucus secreting cells of the Brunner's gland. 

5 Animal Model and its Uses 

The present invention provides, for example, a non-human 
vertebrate animal expressing an AGR2 protein which is modified compared to the 
amino acid sequence of the wild type protein at amino acid position 137. The 
animal may be a mammalian animal, preferably a rodent, in particular from a 

10 genus such as Mus (e.g. mice), Rattus (e.g. rats), Oryctologus (e.g. rabbits) and 
Mesocricetus (e.g. hamsters). In a particularly preferred embodiment the animal is 
a mouse. However, dogs, cats, sheep, and horses are likewise suitable in 
connection with the invention. The same applies to vertebrates such as 
amphibians, in particular Xenopus laevis. 

15 The term "modified" as used herein in connection with the AGR2 

protein and nucleic acids relating thereto refers to an alteration compared to the 
wild type AGR2, e.g., the wild type AGR2 proteins according to SEQ ID NO:3 or 
SEQIDNO:4. 

The term "phenotype" as used herein refers to a collection of 
20 morphological, physiological, behavioral and/or biochemical traits possessed by a 
cell or organism that result from the interaction of the genotype and the 
environment Thus, the non-human vertebrate animal of the present invention 
displays readily observable abnormalities compared to the wild type animal. In a 
preferred embodiment the animal of the invention shows at least 1, preferably at 
25 least 2, and most preferably at least 4 abnormal phenotypical features, preferably 
selected from all of the above categories. 

More generally, the non-human vertebrate animal according to the 
present invention comprises in the genome of at least some or all of its cells an 
allele of a gene encoding a protein having at least 65%, at least 70%, at least 75%, 
30 at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% 
amino acid identity compared to the mouse Agr2 or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4, respectively. 
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The following definitions apply to any reference to nucleic acid or 
amino acid sequence identity throughout the present specification. The term 
"sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular 

5 region of comparison. The phrases "percent amino acid identity" or "% amino 
acid identity" refer to the percentage of sequence identity found in a comparison 
of two or more amino acid or nucleic acid sequences. Percent identity can be 
readily determined electronically, e.g., by using the MEGALIGN program 
(DNASTAR, Inc., Madison Wis.). The MEGALIGN program can create 

10 alignments between two or more sequences according to different methods, one of 
them being the clustal method. See, e.g., Higgins and Sharp (Higgins and Sharp, 
1988). The clustal algorithm groups sequences into clusters by examining the 
distances between all pairs. The clusters are aligned pairwise and then in groups. 
The percentage similarity between two amino acid sequences, e.g., sequence A 

15 and sequence B, is calculated by dividing the length of sequence A, minus the 
number of gap residues in sequence A, minus the number of gap residues in 
sequence B, into the sum of the residue matches between sequence A and 
sequence B, times one hundred. Gaps of low or of no homology between the two 
amino acid sequences are not included in determining percentage similarity. 

20 A particularly preferred method of determining amino acid identity 

between two protein sequences for the purposes of the present invention is using 
the "Blast 2 sequences" (bl2seq) algorithm described by Tatusova et al. (Tatiana 
A. Tatusova, Thomas L. Madden (1999), "Blast 2 sequences - a new tool for 
comparing protein and nucleotide sequences", FEMS Microbiol Lett. 174:247- 

25 250). This method produces an alignment of two given sequences using the 
"BLAST" engine. On-line access of "blasting two sequences" can be gained via 
the NCBI server at http://www.ncbi.nlm.nih.gov/blast/bl2seq^l2.html. The stand- 
alone executable for blasting two sequences (bl2seq) can be retrieved from the 
NCBI ftp site (ftp://ftp.ncbi.nih.gov/blast/executables). Preferrably, the settings of 

30 the program blastp used to determine the number and percentage of identical or 
similar amino acids between two proteins were the following: 
Program: blastp 
Matrix: BLOSUM62 
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Open gap penalty: 1 1 

Extension gap penalty: 1 

Gap x_dropo£f: 50 

Expect: 10.0 

5 Word size: 3 

Low-complexity filter: on 



For the purposes of the present specification, a reference to percent 
amino acid sequence identity means in a preferred embodiment percent identity as 
determined in accordance with the blastp program using the above settings. 

W The protein mentioned above may be, for example, the 

corresponding orthologue of the mouse Agr2 or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4 with respect to the animal. It may 
also be a variant of the mouse Agr2 or the human AGR2 protein according to SEQ 
ID NO:3 and SEQ ID NO:4, or of said orthologue, allelic or otherwise, wherein 

15 certain amino acids or partial amino acid sequences have been replaced, added, or 
deleted. 

In a preferred embodiment, the genome of the cells of the animal 
comprising said allele does not additionally comprise more than one functional 
allele representing a wild type AGR2 gene, for example the corresponding wild 
20 type orthologue with respect to the animal, or a wild type AGR2 gene that is 
heterologous with respect to the genomic DNA of the cells. It is particularly 
preferred that the genome of the above cells does not additionally comprise any 
functional allele representing a wild type AGR2 gene (i.e., no functional allele of 
the corresponding wild type orthologue, or of a heterologous wild type AGR2 
gene). 

The above-mentioned mutated allele comprised in the genome of 
the cells of the non-human vertebrate animal comprises a mutation which, if 
present in the genome of all or essentially all cells of said animal in a homozygous 
manner, in particular in the animal's goblet cells, results in a phenotype associated 
with an alteration in goblet cell function compared to the corresponding wild-type 
animal. It will be appreciated that this mutation may reside in either the coding or 
the non-coding region of the allele. 
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The above-mentioned phenotypes may be characterized by an 
alteration in goblet cell differentiation, particularly terminal differentiation, or an 
alteration in goblet cell mucus production or secretion. They may also be 
characterized by an alteration in mucus composition, e.g., in respect of the levels 
5 of typical mucus constituents, e.g., mucin2 (muc2) or trefoil peptides. Such 
phenotypes may also be characterized by any combination of these phenomena. 

A typical phenotype of a non-human vertebrate animal in this 
regard is one characterized by a reduction in pre-mucin storing granules in the 
goblet cells, an altered mucus secretion, secondary inflammatory infiltrations in 
10 the intestinal mucosal epithelium and submucosa. The phenotype of the non- 
human vertebrate animal as described herein may optionally be furthermore 
associated with an increased proliferation of the glandular epithelium of the 
Brunner's gland. 

The phenotype of the non-human vertebrate animal according to 

15 the present invention may further be characterized by reduced transcription levels 
of the late differentiation markers Muc-2 and TFF3 in goblet cells. 

Furthermore, a typical phenotype of a non-human vertebrate 
animal according to the present invention is one wherein the alteration results in 
diarrhea, or diarrhea and a thriving deficit. 

20 In another non-human vertebrate animal according to the present 

invention the mutated allele contains a mutation corresponding to a mutation in 
the mouse Agr2 protein or the human AGR2 protein according to SEQ ID NO:3 
and SEQ ID NO:4, respectively, which leads to an altered biological activity of 
the mutated protein when compared to the corresponding wild type mouse Agr2 

25 protein or human AGR2 protein in an in vitro assay. 

The term "corresponds to" as used in this regard and throughout 
the present specification means that the mutated allele reflects the mutation in the 
mouse Agr2 protein or the human AGR2 protein according to SEQ ID NO:3 and 
SEQ ID NO:4 on the amino acid level. Where the sequences of the allele flanking 

30 the mutation do not encode amino acids identical to those at the corresponding 
positions in the amino acid sequences of the mouse Agr2 or the human AGR2 
protein defined above, the skilled artisan will be readily able to align the amino 
acid sequences encoded by the flanking sequences with the corresponding amino 
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acids of the mouse Agr2 or the human AGR2 protein, preferably by using the 
above-mentioned method of determining amino acid sequence identity, and 
determine whether a mutation in the mouse Agr2 protein or the human AGR2 
protein of the kind mentioned above is reflected by the amino acid sequence 
5 encoded by said allele. In case of an amino acid substitution or insertion, the 
mutation is preferably reflected by the amino acid sequence encoded by the allele 
in such a way that an identical amino acid or amino acid sequence is found at the 
corresponding position of the protein encoded by the allele. In case of an amino 
acid deletion, the mutation is preferably reflected by the amino acid sequence 

10 encoded by the allele in such a way that an identical or corresponding amino acid 
or amino acid sequence is deleted at the corresponding position of the protein 
encoded by the allele. 

The term "altered biological activity in an in vitro assay" as used 
above in connection with the reference to the in vitro assay and throughout the 

15 present specification refers either to an increased or a decreased biological 
activity. The increase in biological activity is preferably an at least 10%, 20%, 
30%, 40%, 50%, 70%, 80%, 90%, or a 100% or an even higher increase as 
compared to the wild type mouse Agr2 protein or human AGR2 protein according 
to SEQ ID NO:3 and SEQ ID NO:4, respectively. Likewise, the decrease in 

20 biological activity is preferably an at least 10%, 20%, 30%, 40%, 50%, 70%, 80%, 
or 90% decrease, or an even complete abolishment of biological activity as 
compared to the wild type mouse Agr2 protein or human AGR2 protein according 
to SEQ ID NO:3 and SEQ ID NO:4, respectively. Since the increase or decrease 
in biological activity are determined by comparing mouse Agr2 or human AGR2 

25 muteins carrying the corresponding mutation to wild type mouse Agr2 or human 
AGR2 protein in the same assay, preferably side-by-side and under the same assay 
conditions, therefore resulting in relative values, it will be appreciated that the 
skilled person will be readily able to determine the above percentages of alteration 
in biological activity in the in vitro assays contemplated in connection with the 

30 present invention. 

Monitoring colon cell proliferation is one suitable assay to 
determine altered biological activity of a AGR2 mutein according to the present 
invention compared to wild type mouse Agr2 protein or human AGR2 protein. 
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One assay preferred in this regard is described herein in Example 20. In such a 
preferred assay, the incorporation of a label added to the culture medium into the 
cellular DNA of the cultured colon cells is monitored. The cultured cells are 
preferably mammalian colon cancer cell lines. Particularly preferred are the 
5 mammalian colon cancer cell lines LS174T or HT29. Cells are transfected with a 
wild type AGR2 expression vector (e.g., a vector expressing mouse Agr2 protein 
or human AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4, 
respectively), or with an expression vector expressing the AGR2 mutein of 
interest (i.e., expressing any of the novel AGR2 proteins or protein fragments 

10 described and claimed herein). Alternatively, AGR2 wild type protein (again 
preferably mouse Agr2 protein or human AGR2 protein according to SEQ ID 
NO:3 and SEQ ID NO:4, respectively) and the AGR2 mutein of interest (which 
may again be any of the novel AGR2 proteins or protein fragments described and 
claimed herein) may be added separately to the above cells in culture. In a 

15 preferred embodiment, the label used to monitor cell proliferation is a nucleoside 
analogue, for example, Bromodeoxyuridine (BrdU), which may be detected via 
anti-BrdU mouse monoclonal antibodies and subsequent immunofluorescence, 
immunohistochemical, ELISA or colorimetric methods. Alternatively, 
3[H]thymidine incorporation into the cellular DNA and subsequent liquid 

20 scintillation chromatography may be used. 

A further suitable in vitro assay to determine altered biological 
activity of a AGR2. mutein according to the present invention compared to wild 
type mouse Agr2 protein or human AGR2 protein is measuring goblet cell mucus 
secretion in culture. An assay preferred in this regard is described in Example 21. 

25 In such a preferred assay, mammalian goblet cells, and preferably mammalian 
colon cancer cell lines LS174T or HT29 are transfected with an AGR2 wild type 
expression vector (e.g., a vector expressing mouse Agr2 protein or human AGR2 
protein according to SEQ ID NO:3 and SEQ ID NO:4, respectively), or with an 
expression vector expressing the AGR2 mutein of interest (i.e., expressing any of 

30 the novel AGR2 proteins or protein fragments described and claimed herein). 
Alternatively, AGR2 wild type protein (e.g., mouse Agr2 protein or human AGR2 
protein according to SEQ ID NO:3 and SEQ ID NO:4, respectively) and the 
AGR2 mutein of interest (which may again be any of the novel AGR2 proteins or 
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protein fragments described and claimed herein) may be added separately to the 
above cells in culture. Subsequently, the cells are analyzed for changes in the 
expression of major mucin subtypes secreted by intestinal goblet cells, preferably 
for the expression of mucin2 (muc2). This can be done, for example, via RT-PCR 
5 (reverse transciption £olymerase chain reaction) using muc2-specific primers and 
mRNA from transfected and non-transfected or mock-transfected control cells, 
and subsequent quantitative PCR analysis. Alternatively, or in addition, the cells 
may be analyzed for changes in the expression of trefoil proteins, again, for 
example, via RT-PCR using trefoil-specific primers and mRNA from transfected 
10 and non-transfected or mock-transfected control cells and subsequent quantitative 
PCR analysis. 

Yet a further suitable in vitro assay to determine altered biological 
activity of an AGR2 mutein according to the present invention compared to wild 
type mouse Agr2 protein or human AGR2 protein is measuring Xenopus laevis 
cement gland differentiation, e.g., as described by Aberger et al. (Aberger et aL, 
1998). An assay preferred in this regard is described in Example 19. In such a 
preferred assay, the effect of expression or over-expression of wild type AGR2 
protein or AGR2 mutein upon the induction of ectopic cement gland 
differentiation and expression of anterior neural marker genes in Xenopus 
embryos is analyzed. In particular, vectors capable of expressing mRNA encoding 
wild type AGR2 protein (e.g., mouse Agr2 protein or human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4, respectively), or mRNA encoding 
the AGR2 mutein of interest (i.e., encoding any of the novel AGR2 proteins or 
protein fragments described and claimed herein) are subjected to in vitro 
transcription, optionally followed by analyzing the quality of the RNA obtained 
via an in vitro translation system, e.g., reticulocyte lysate, and the capped mRNA 
thus obtained injected into early cleavage stage embryos of Xenopus laevis. 
Biological activity is subsequently analyzed by monitoring differentiation of 
mucin secreting cement glands. For example, biological activity is analyzed by 
monitoring cement gland enlargement or the presence of additional ectopic 
cement glands, as described in Aberger et al. 

A non-human vertebrate animal according to the present invention 
is furthermore one wherein the mutated allele contains a mutation which 
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corresponds to a mutation of the human AGR2 protein according to SEQ ID NO:4 
which is indicative of an increased risk of a human subject of developing a 
medical condition associated with an alteration in goblet cell function, or 
indicative of an association of a medical condition in a human subject which is 

5 associated with an alteration in goblet cell function with altered AGR2 expression 
or function. The term "corresponds to" again refers to the fact that the allele 
reflects the mutation in the way explained in more detail above. Mutations of the 
kind contemplated in this regard, and suitable methods of identifying them, are 
described in more detail below. 

10 In view of the fact that the present invention demonstrates for the 

first time that AGR2 is required for normal goblet cell function, and that mutating 
this gene and its gene product may result in goblet cell dysfunction and 
corresponding physiological and medical disorders of the affected animal, it will 
be apparent to the skilled artisan that other genes and their products which in turn 

15 . affect AGR2 gene expression or the function of the AGR2 protein will likewise 
affect goblet cell-related phenotypes and physiological and medical conditions. 
Accordingly, the present invention provides in a further aspect a non-human 
vertebrate animal comprising in the genome of at least some or all of its cells an 
allele of a gene coding for a protein which affects expression or function of the 

20 AGR2 protein of the animal, said allele comprising a mutation which, if present in 
the genome of all or essentially all cells of said animal in a homozygous manner, 
results in a phenotype associated with an alteration in goblet cell function 
compared to the corresponding wild-type animal. 

The gene referred to above in connection with the animal according 

25 to the invention is preferably an endogenous gene with respect to said animal. In 
preferred embodiments, the gene will encode a protein which is an orthologue of 
the AGR2 proteins defined by SEQ ID NO:3 and SEQ ID NO:4 with respect to 
said animal. The gene may, however, also be a heterologous gene with respect to 
said animal. For example, a mouse according to the present invention may be one 

30 wherein the endogenous mouse Agr2 gene has been replaced by a mutated human 
AGR2 gene, e.g., by an AGR2 gene encoding a protein according to SEQ ID 
NO:30. Likewise, a rat according to the present invention may be one wherein the 
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endogenous rat AGR2 gene has been replaced by a mutated mouse Agr2 gene, 
e.g., by an Agr2 gene encoding a protein according to SEQ ID NO:2. 

As will be apparent from the previous explanations, the non-human 
vertebrate animals according to the invention may also be transgenic animals, i.e., 

5 the mutated allele of the gene may represent DNA that is heterologous with 
respect to the genomic DNA of said animal, or it may be mutated by virtue of the 
insertion of DNA that is heterologous with respect to the genomic DNA of said 
animal. Heterologous DNA may be inserted, for example, by the method of 
targeting vector-mediated homologous recombination at the Agr-2 genomic DNA 

10 locus in mouse embryonic stem cells, resulting in a replacement of the 
endogenous Agr-2 allele by heterologous DNA, as will be appreciated by those 
skilled in the art. Transgenic animals may then be generated by subsequent 
breeding. 

The endogenous promoter of the AGR2 gene or the gene affecting 
15 its expression or function may be replaced by a heterologous promoter, e.g., a 
promoter imposing a different tissue specificity of expression upon the gene, or a 
promoter that is inducible by chemical or physical means. 

The non-human vertebrate animal according to the invention may 
also be a "knock-out" animal with respect to the AGR2 gene or the gene affecting 
20 expression or function of the AGR2 protein. In these animals, the above- 
mentioned mutation results in the reduction or complete abolishment of 
expression of said gene. 

The mutated allele may be present in the germ cells or the somatic 
cells of the non-human vertebrate animal, or both. In a preferred embodiment, the 
25 genome of said cells is homozygous with respect to said allele. 

The present invention further provides for inbred successive lines 
of animals carrying the mutant AGR2 nucleic acid of the present invention that 
offer the advantage of providing a virtually homogenous genetic background. A 
genetically homogenous line of animals provides a functionally reproducible 
30 model system for disorders or symptoms associated with alterations in goblet cell 
function and mucosal epithelium. 
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In a particularly preferred embodiment the non-human vertebrate 
animal according to the invention expresses in at least some of its cells, preferably 
the goblet cells, a polypeptide as shown in SEQ ID NO:2 or SEQ ID NO:30. 

The animals of the invention can be produced by using any 
technique known to the person skilled in the art; including but not limited to 
micro-injection, electroporation, cell gun, cell fusion, micro-injection into 
embryos of teratocarcinoma stem cells or functionally equivalent embryonic stem 
cells. The animals of the present invention may be produced by the application of 
procedures, which result in an animal with a genome that incorporates/integrates 
exogenous genetic material in such a manner as to modify or disrupt the function 
of the normal AGR2 gene or protein. A preferred procedure for generating an 
animal of this invention is one according to Example 1. 

Alternatively, the procedure may involve obtaining genetic 
material, or a portion thereof, which encodes a wild type AGR2 protein, as 
described in Example 5. The isolated native sequence is then genetically 
manipulated by the insertion of any of the mutations described and claimed in 
accordance with the present invention, e.g., a mutation appropriate to replace, e.g., 
the residue at position 137 of the amino acid sequence shown in SEQ ID NO:3 or 
SEQ ID NO:4. The manipulated construct may then be inserted into embryonic 
stem cells, e.g., by electroporation. The cells subjected to the procedure are 
screened to find positive cells, i.e., cells, which have integrated into their genome 
the desired construct encoding an altered AGR2. The positive cells may be 
isolated, cloned (or expanded) and injected into blastocysts obtained from a host 
animal of the same species or a different species. For example, positive cells are 
injected into blastocysts from mice, the blastocysts are then transferred into a 
female host animal and allowed to grow to term, following which the offspring of 
the female are tested to determine which animals are transgenic,* i.e., which 
animals have an inserted exogenous mutated DNA sequence. One suitable method 
involves the introduction of the recombinant gene at the fertilized oocyte stage 
ensuring that the gene sequence will be present in all of the germ cells and 
somatic cells of the "founder" animal. The term "founder animal" as used herein 
means the animal into which the recombinant gene was introduced at the one cell 
embryo stage. 
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The animals of the invention can also be used as a source of 
primary cells from a variety of tissues, for cell culture experiment, including, but 
not limited to, the production of immortalized cell lines by any methods known in 
the art, such as retroviral transformation. Such primary cells or immortalized cell 

5 lines derived from any one of the non-human vertebrate animals described and 
claimed herein are likewise within the scope of the present invention. Such 
immortalized cells from these animals may advantageously exhibit desirable 
properties of both normal and transformed cultured cells, i.e., they will be normal 
or nearly normal morphologically and physiologically, but can be cultured for 

10 long, and perhaps indefinite periods of time. The primary cells or cell lines 
derived thereof may furthermore be used for the construction of an animal model 
according to the present invention. 

In other embodiments cell lines according to the present invention 
may be prepared by the insertion of a nucleic acid construct comprising the 

15 nucleic acid sequence of the invention or a fragment thereof comprising the codon 
imparting the above-described phenotype to the animal model of the invention. 
Suitable cells for the insertion include primary cells harvested from an animal as 
well as cells, which are members of an immortalized cell line. Recombinant 
nucleic acid constructs of the invention, described below, may be introduced into 

20 the cells by any method known in the art, including but not limited to, 
transfection, retroviral infection, micro-injection, electroporation, transduction or 
DEAE-dextran. Cells, which express the recombinant construct, may be identified 
by, for example, using a second recombinant nucleic acid construct comprising a 
reporter gene, which is used to produce selective expression. Cells that express the 

25 nucleic acid sequence of the invention or a fragment thereof may be identified 
indirectly by the detection of reporter gene expression. 

It will be appreciated that the non-human vertebrate animals of the 
invention are useful in various respects in connection with goblet cell function or 
dysfunction and goblet cell-related phenotypes and medical conditions. 

30 Accordingly, one aspect of the present invention is the use of the 

non-human vertebrate animal for the identification of a protein or nucleic acid 
diagnostic marker for a goblet cell-related disorder. Also within the scope of the 
present invention is the use of the animal as a model for studying the molecular 
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mechanisms of, or physiological processes associated with, a goblet cell-related 
disorder. 

Furthermore, the non-human vertebrate animal of the present 
invention may be used for the identification and testing of agents useful in the 

5 prevention, amelioration, or treatment of a goblet cell-related disorder. Such 
goblet cell-related disorders are in particular asthma, chronic obstructive 
pulmonary disease (COPD), cystic fibrosis, dry eye syndrome, gastric disease, 
peptic ulcer, inflammatory bowel disease (in particular Crohn's disease or 
ulcerative colitis), and intestinal cancer. 

10 Further uses of the non-human vertebrate animals described herein 

which form additional aspects of the present invention are those relating to 
studying the molecular mechanisms of, or physiological processes associated 
with, conditions associated with, or affected by, reduced activity or undesirable, 
e.g., increased, activity of endogenous AGR2. Likewise, conditions associated 

15 with reduced expression, reduced production or undesirable, e.g., increased 
production of endogenous AGR2 may be analyzed. 

It will also be appreciated that the non-human vertebrate animals 
described herein will be highly useful as a model system for the screening, 
identification and testing of agents useful in the prevention, amelioration, or 

20 treatment of the above-mentioned conditions. Such agents may be, for example, 
small molecule drugs, peptides or polypeptide, or nucleic acids. For the purposes 
of the present invention, small molecule drugs preferably have a molecular weight 
of no more than 2,000 Dalton, more preferably no more than 1500 Dalton, even 
more preferably no more than 1000 Dalton, and most preferably no more than 

25 500, 400, 300 or even 200 Dalton. Such agents may alter the biological activity of 
the wild type AGR2 or the AGR2 mutein, i.e., these agents may act on both types 
of proteins as agonist or antagonist. 

It will furthermore be apparent from the above that the non-human 
vertebrate animals described herein will be highly useful for identifying protein or 

30 nucleic acid diagnostic markers, such as diagnostic markers relating to genes or 
gene products that play a role in the early phase, the intermediate phase, and/or the 
late phase of medical conditions associated with an alteration in goblet cell 
function, e.g., for diseases associated with wild type AGR2 or AGR2 mutein 
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deficiency or over-expression. It will be appreciated that such diagnostic markers 
may relate to the AGR2 gene or its protein product However, it will be 
appreciated that the non-human vertebrate animal according to the present 
invention can also be used to identify markers relating to other genes or gene 
5 products that affect AGR2 gene or protein expression or function, or the 
expression or function of which is affected by the AGR2 protein. Moreover, since 
the non-human vertebrate animal of the invention represents a highly useful model 
system for studying the pathogenesis of medical conditions associated with an 
alteration in goblet cell function, it will be appreciated that it may also be used to 
10 identify disease-relevant markers relating to genes or gene products that do not 
directly affect AGR2 gene or protein expression or function, or the expression or 
function of which is not directly affected by the AGR2 protein. It will be 
appreciated that the above-mentioned uses represent further aspects of the present 
invention. 

Finally, it will be appreciated from the above that the non-human 
vertebrate animals described herein will be highly useful for identifying receptors 
of the AGR2 protein, or upstream or downstream genes or proteins regulated by 
the AGR2 protein or gene activity, and deregulated in disorders associated with 
AGR2 deficiency or over-expression. 

Nucleic Acids 

The present invention furthermore provides nucleic acid sequences 
encoding the AGR2 muteins as described in more detail below, for example 
murine and human AGR2 mutated in accordance with the present invention. In a 
preferred embodiment, this invention provides a mutated nucleic acid sequence 
for murine AGR2 (SEQ ID NO:l). Furthermore, this invention provides a mutated 
nucleic acid sequence of human AGR2 (SEQ ID NO:29). Mutated human AGR2 
genes can be made, for example, by altering codon 137 of the wild type human 
AGR2 gene (SEQ ID NO:5), such that codon 137 no longer encodes valin. The 
construction of a gene with a 137 th codon that does not encode valin is well 
known. Valin is encoded by GTT, GTC, GTA and GTG. A codon that does not 
encode valin may be, for example, a codon that encodes Phe (TTT, TTC); Leu 
(TTA, TTG, CTT, CTC, CTA, CTG); He (ATT, ATC, ATA); Met (ATG); Asp 
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(GAC, GAT); Ser (TCT, TCC, TCA, TCG), Pro (CCT, CCC, CCA, CCG); Thr 
(ACT, ACQ ACA, ACG), Ala (GCT, GCC, GCA, GCG); Tyr (TAT, TAC); His 
(CAT, CAC), Gin (CAA, CAG); Asn (AAT, AAC); Lys (AAA, AAG); Glu 
(GAA, GAG); Cys (TGT, TGC); Tip (TGG); Arg (CGT, CGC, CGA, CGG, 
5 AGA, AGG); Ser (AGT, AGC); Gly (GGT, GGC, GGA, GGG) or one of the stop 
codons (TAA, TAG, TGA). Methods for the introduction of site-specific nucleic 
acid mutations are well known. 

The nucleic acid sequences encoding mutant AGR2 of the 
invention may exist alone or in combination with other nucleic acids as, for 
10 example, vector molecules, such as plasmids, including expression or cloning 
vectors. 

The term "nucleic acid sequence 5 ' as used herein refers to any 
contiguous sequence series of nucleotide bases, i.e., a polynucleotide, and is 
preferably a ribonucleic acid (RNA) or deoxy-ribonucleic acid (DNA). Preferably 

15 the nucleic acid sequence is cDNA. It may, however, also be, for example, a 
peptide nucleic acid (PNA). 

An "isolated" nucleic acid molecule, as referred to herein, is one, 
which is separated from other nucleic acid molecules ordinarily present in the 
natural source of the nucleic acid. Preferably, an "isolated" nucleic acid is free of 

20 sequences, which naturally flank the nucleic acid (i.e., sequences located at the 5»- 
and 3'-termini of the nucleic acid) in the genomic DNA of the organism that is the 
natural (wild type) source of the DNA. 

AGR2 gene molecules can be isolated using standard hybridization 
and cloning techniques, as described, for instance, in Sambrook et al. (eds.), 

25 Molecular Cloning: A Laboratory Manual (2 nd Ed.), Cold Spring. Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989; and Ausubel et al. (eds.), 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 
NY, 1993. 

A nucleic acid of the invention can be amplified using cDNA, 
30 mRNA or, alternatively, genomic DNA, as a template and appropriate 
oligonucleotide primers according to standard PCR amplification techniques. The 
nucleic acid so amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, oligonucleotides 
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corresponding to AGR2 nucleotide sequences can be prepared by standard 
synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of 
linked nucleotide residues, which oligonucleotide has a sufficient number of 
5 nucleotide bases to be used in a PCR reaction. A short oligonucleotide sequence 
may be based on, or designed from, a genomic or cDNA sequence and is used to 
amplify, confirm, or reveal the presence of an identical, similar or complementary 
DNA or RNA in a particular cell or tissue. Generally, the term "oligonucleotide" 
is used to refer to a series of contiguous nucleotides (a polynucleotide) of about 
10 100 nucleotides (nt) or less, e.g., portions of a nucleic acid sequence of about 100 
nt, 50 nt, or 20 nt in length, preferably nucleotide sequences of about 15 nt to 30 
nt in length. 

As used herein, the term "complementary" refers to Watson-Crick 
or Hoogsteen base pairing between nucleotide units of a nucleic acid molecule, 

15 and the term "binding" means the physical or chemical interaction between two 
polypeptides or compounds or associated polypeptides or compounds or 
combinations thereof. 

A "homologous nucleic acid sequence" or "homologous amino 
acid sequence," or variations thereof, refers to sequences characterized by a 

20 homology at the nucleotide level or amino acid level, respectively. Homologous 
nucleotide sequences can include those sequences coding for isoforms of AGR2 
polypeptides. Isoforms can be expressed in different tissues of the same organism 
as a result of, for example, alternative splicing of RNA. Alternatively, isoforms 
can be encoded by different genes. 

25 As used herein, the phrase "stringent hybridization conditions" 

refers to conditions under which a probe, primer or oligonucleotide or any other 
nucleic acid sequence referred to herein will hybridize to its target sequence, but 
to no other sequences. Stringent conditions are sequence-dependent and will be 
different in different circumstances. Longer sequences hybridize specifically at 

30 higher temperatures than shorter sequences. Generally, stringent conditions are 
selected to be about 5°C lower than the thermal melting point (Tm) for the 
specific sequence at a defined ionic strength and pH. The Tm is the temperature 
(under defined ionic strength, pH and nucleic acid concentration) at which 50% of 
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the probes complementary to the target sequence hybridize to the target sequence 
at equilibrium. Since the target sequences are generally present at excess, at Tm, 
50% of the probes are occupied at equilibrium. Typically, stringent conditions will 
be those in which the salt concentration is less than about 1.0 M sodium ion, 
5 typically about 0.01 to 1.0 M sodium ion (or other salts) at pH 7.0 to 8.3, and the 
temperature is at least about 30°C for short probes, primers or oligonucleotides 
(e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers and 
oligonucleotides. Stringent conditions may also be achieved with the addition of 
destabilizing agents, such as formamide. Stringent conditions are known to those 

10 skilled in the art and can be found in Ausubel et al. (eds.), Current Protocols 
in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. 

Preferred stringent hybridization conditions in accordance with the 
nucleic acids of the present invention, for example the antisense nucleic acids 
described further below, are hybridization in a high salt buffer comprising 6x 

15 SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% 
BSA, and 500 mg/ml denatured salmon sperm DNA at 65 °C, followed by one or 
more washes in 0.2x SSC, 0.01% BSA at 50°C. 

As used herein, for example, in connection with the antisense 
nucleic acids of the present invention described further below, the phrase 

20 "hybridization under physiological conditions" refers to hybridization of a probe, 
primer or oligonucleotide, or any other nucleic acid sequence to its target 
sequence under conditions as they are found inside eukaryotic cells either within a 
multicellular organism or under conditions of cell or tissue culture. Such 
conditions are preferably characterized by a temperature of about or exactly 37°C, 

25 absence of formamide, and an ionic strength corresponding to physiological 
buffer. 

Antisense Nucleic Acids 

A preferred nucleic acid according to the present invention is an 
30 antisense nucleic acid comprising a nucleotide sequence which is complementary 
to a part of an mRNA encoding a mutein according to the present invention, said 
part encoding an amino acid sequence comprising the amino acid or amino acid 
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sequence which corresponds to the mutation described in more detail in 
connection with said muteins. 

A further preferred antisense nucleic acid is one comprising a 
nucleotide sequence which is complementary to a part of an mRNA encoding the 
5 mouse Agr2 or the human AGR2 protein according to SEQ ID NO:3 and SEQ ID 
NO:4, respectively, or an orthologue thereof having at least 65%, 70%, 75%, 80%, 
85%, 90%, 95%, 98%, or 99% amino acid identity compared to the mouse Agr2 
or the human AGR2 protein as defined above, said part being a non-coding part 
and comprising a sequence corresponding to a mutation in the gene coding for 

10 said protein or orthologue which affects expression of said protein or orthologue. 

Yet a further preferred antisense nucleic acid is one comprising a 
nucleotide sequence which is complementary to a part of an mRNA encoding a 
protein which affects expression or function of the mouse Agr2 or the human 
AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4, respectively, or an 

15 orthologue thereof having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 
or 99% amino acid identity compared to the mouse Agr2 or the human AGR2 
protein according to SEQ ID NO:3 and SEQ ID NO:4, respectively. 

In a preferred embodiment, the antisense nucleic acid is capable of 
hybridizing to the mRNA via the complementary nucleotide sequence under 

20 physiological conditions, in particular the preferred physiological conditions 
defined above. In this case, the antisense RNA is inter alia suitable to be used in 
connection with the methods and uses of the present invention that relate to the 
prevention, treatment, or amelioration of a medical condition associated with an 
alteration in goblet cell function. In another preferred embodiment, the antisense 

25 RNA according to the present invention is capable of hybridizing to said mRNA 
under high stringency conditions, in particular the preferred high stringency 
conditions defined above. 

The antisense nucleic acid may be a ribozyme comprising a 
catalytic region; suitably, the catalytic regiion enables the antisense RNA to 

30 specifically cleave the mRNA to which the antisense RNA hybridizes. 

It may be advantageous that the antisense nucleic acid of the 
invention hybridizes more effectively to its target mRNA than to an mRNA 
encoding the same protein which, however, corresponds to the wild-type mouse 
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Agr2 or human AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4 in 
respect of the mutated amino acid sequence. Also preferred are antisense nucleic 
acids which hybridize more effectively to their target mRNA than to the mRNA 
encoded by the wild-type genes encoding the mouse Agr2 protein or the human 

5 AGR2 protein according to SEQ ID NO:3 and SEQ ID NO;4, respectively, or the 
wild-type gene encoding the corresponding orthologue. Preferred are in addition 
antisense nucleic acids which hybridize more effectively to their target mRNA 
than to the mRNA encoded by the wild-type gene of the corresponding protein 
which affects expression or function of the mouse Agr2 or the human AGR2 

10 protein according to SEQ ID NO:3 and SEQ ID NO:4, respectively. 

Prokaryotic and eukaryotic host cells transformed with the above 
antisense nucleic acids are likewise within the scope of the present invention. 

Aptamers 

15 Aptamers are macromolecules composed of nucleic acid, such as 

RNA or DNA, that tightly bind to protein. The present invention provides 
aptamers specifically binding to the proteins described herein. Preferably, the 
specificity of the aptamers is sufficient so that they do not, or substantially do not, 
bind to any other protein in the cell. Preferred aptamers bind to the AGR2 muteins 

20 of the present invention, or a portion thereof comprising a mutation as described 
herein, e.g., a substitution of amino acid 137. Another preferred aptamer binds to 
the wild type AGR2 protein or a portion thereof. The aptamers of the present 
invention preferably bind their ligands with high specificity and affinity in the 
nanomolar range, e.g., in the low nanomolar range with K(D) values ranging 

25 between 12 nM and 130 nM. 

Interfering RNA 

In one aspect of the invention, AGR2 gene expression can be 
attenuated by RNA interference. One approach well-known in the art is short 
30 interfering RNA (siRNA) mediated gene silencing where expression products of a 
AGR2 gene are targeted by specific double stranded AGR2 derived siRNA 
nucleotide sequences that are complementary to at least a 19-25 nt long segment 
of the AGR2 gene transcript, including the 5' untranslated (UT) region, the open 
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reading frame (ORF), or the 3 s UT region. See, for example, PCT applications 
WO00/44895, W099/32619, WO01/75164, WO01/92513, WO01/29058, 
WO01/89304, WO02/16620, and WO02/29858, each incorporated by reference 
herein in their entirety. Targeted genes can be an AGR2 gene, or an upstream or 
5 downstream modulator of AGR2 gene expression or protein activity. For example, 
expression of a phosphatase or kinase of AGR2 may be targeted by an siRNA. 

According to the methods of the present invention, AGR2 gene 
expression is silenced using short interfering RNA. An AGR2 polynucleotide 
according to the invention includes an siRNA polynucleotide. Such an AGR2 
10 siRNA can be obtained using an AGR2 polynucleotide sequence, for example, by 
processing the AGR2 ribopolynucleotide sequence in a cell-free system, such as 
but not limited to a Drosophila extract, or by transcription of recombinant double 
stranded AGR2 RNA or by chemical synthesis of nucleotide sequences 
homologous to a AGR2 sequence. See, e.g., Tuschl, Zamore, Lehmann, Bartel and 
15 Sharp (1999), Genes & Dev. 13: 3191-3197, incorporated herein by reference in 
its entirety (Tuschl et al., 1999). When synthesized, a typical 0.2 micromolar-scale 
RNA synthesis provides about 1 milligram of siRNA, which is sufficient for 1000 
transfection experiments using a 24-well tissue culture plate format. 

The most efficient silencing is generally observed with siRNA 
20 duplexes composed of a 21-nt sense strand and a 21-nt antisense strand, paired in 
a manner to have a 2-nt 3' overhang. The sequence of the 2-nt 3' overhang makes 
an additional small contribution to the specificity of siRNA target recognition. 
The contribution to specificity is localized to the unpaired nucleotide adjacent to 
the first paired bases. In one embodiment, the nucleotides in the 3' overhang are 
25 ribonucleotides. In an alternative embodiment, the nucleotides in the 3' overhang 
are deoxyribonucleotides. Using 2 , -deoxynucleotides in the 3' overhangs is as 
efficient as using ribonucleotides, but deoxyribonucleotides are often cheaper to 
synthesize and are most likely more nuclease resistant. 

A recombinant expression vector of the invention comprises a 
30 AGR2 DNA molecule cloned into an expression vector comprising operatively- 
linked regulatory sequences flanking the AGR2 sequence in a manner that allows 
for expression (by transcription of the DNA molecule) of both strands. An RNA 
molecule that is antisense to AGR2 mRNA is transcribed by a first promoter (e.g., 
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a promoter sequence 3' of the cloned DNA) and an RNA molecule that is the 
sense strand for the AGR2 mRNA is transcribed by a second promoter (e.g., a 
promoter sequence 5' of the cloned DNA). The sense and antisense strands may 
hybridize in vivo to generate siRNA constructs for silencing of the AGR2 gene. 
5 Alternatively, two constructs can be utilized to create the sense and anti-sense 
strands of an siRNA construct. Finally, cloned DNA can encode a construct 
having secondary structure, wherein a single transcript has both the sense and 
complementary antisense sequences from the target gene or genes. In an example 
of this embodiment, a hairpin RNAi product is homologous to all or a portion of 

10 the target gene. In another example, a hairpin RNAi product is an siRNA. The 
regulatory sequences flanking the AGR2 sequence may be identical or may be 
different, such that their expression may be modulated independently, or in a 
temporal or spatial manner. 

In a specific embodiment, siRNAs are transcribed intracellularly by 

15 cloning the AGR2 gene templates into a vector containing, e.g., a RNA pol III 
transcription unit from the smaller nuclear RNA (snRNA) U6 or the human RNase 
P RNA HI. One example of a vector system is the GeneSuppressor™ RNA 
Interference kit (commercially available from Imgenex). The U6 and HI 
promoters are members of the type III class of Pol III promoters. The +1 

20 nucleotide of the U6-like promoters is always guanosine, whereas the +1 for HI 
promoters is adenosine. The termination signal for these promoters is defined by 
five consecutive thymidines. The transcript is typically cleaved after the second 
uridine. Cleavage at this position generates a 3' UU overhang in the expressed 
siRNA, which is similar to the 3* overhangs of synthetic siRNAs. Any sequence 

25 less than 400 nucleotides in length can be transcribed by these promoter, therefore 
they are ideally suited for the expression of around 21-nucleotide siRNAs in, e.g., 
an approximately 50-nucleotide RNA stem-loop transcript. 

siRNA vectors appear to have an advantage over synthetic siRNAs 
where long term knock-down of expression is desired. Cells transfected with a 

30 siRNA expression vector would experience steady, long-term mRNA inhibition. 
In contrast, cells transfected with exogenous synthetic siRNAs typically recover 
from mRNA suppression within seven days or ten rounds of cell division. The 
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long-term gene silencing ability of siRNA expression vectors may provide for 
applications in gene therapy. 

In general, siRNAs are chopped from longer dsRNA by an ATP- 
dependent ribonuclease called DICER. DICER is a member of the RNase HI 
5 family of double-stranded RNA-specific endonucleases. The siRNAs assemble 
with cellular proteins into an endonuclease complex. In vitro studies in Drosophila 
suggest that the siRNAs/protein complex (siRNP) is then transferred to a second 
enzyme complex, called an RNA-induced silencing complex (RISC), which 
contains an endoribonuclease that is distinct from DICER. RISC uses the 
10 sequence encoded by the antisense siRNA strand to find and destroy mRNAs of 
complementary sequence. The siRNA thus acts as a guide, restricting the 
ribonuclease to cleave only mRNAs complementary to one of the two siRNA 
strands. 

An AGR2 mRNA region to be targeted by siRNA is generally 
selected from a desired AGR2 sequence beginning 50 tolOO nt downstream of the 
start codon. Alternatively, 5' or 3' UTRs and regions nearby the start codon can be 
used but are generally avoided, as these may be richer in regulatory protein 
binding sites. UTR-binding proteins and/or translation initiation complexes may 
interfere with binding of the siRNP or RISC endonuclease complex. An initial 
BLAST homology search for the selected siRNA sequence is done against an 
available nucleotide sequence library to ensure that only one gene is targeted. 
Specificity of target recognition by siRNA duplexes indicate that a single point 
mutation located in the paired region of an siRNA duplex is sufficient to abolish 
target mRNA degradation. See Elbashir et al. 2001 EMBO J. 20(23):6877-88 
(Elbashir et al., 2001b). Hence, consideration should be taken to accommodate 
SNPs, polymorphisms, allelic variants or species-specific variations when 
targeting a desired gene. 

A complete AGR2 siRNA experiment should include the proper 
negative control. Negative control siRNA should have the same nucleotide 
composition as the AGR2 siRNA but lack significant sequence homology to the 
genome. Typically, one would scramble the nucleotide sequence of the AGR2 
siRNA and do a homology search to make sure it lacks homology to any other 
gene. 
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Two independent AGR2 siRNA duplexes can be used to knock- 
down a target AGR2 gene. This helps to control for specificity of the silencing 
effect. In addition, expression of two independent genes can be simultaneously 
knocked down by using equal concentrations of different AGR2 siRNA duplexes. 
5 Availability of siRNA-associating proteins is believed to be more limiting than 
target mRNA accessibility. 

A targeted AGR2 region is typically a sequence of two adenines 
(AA) and two thymidines (TT) divided by a spacer region of nineteen (N19) 
residues (e.g., AA(N19)TT). A desirable spacer region has a G/C-content of 
10 approximately 30% to 70%, and more preferably of about 50%. If the sequence 
AA(N19)TT is not present in the target sequence, an alternative target region 
would be AA(N21). The sequence of the AGR2 sense siRNA corresponds to 
(N19)TT or N21, respectively. In the latter case, conversion of the 3' end of the 
sense siRNA to TT can be performed if such a sequence does not naturally occur 
15 in the AGR2 polynucleotide. The rationale for this sequence conversion is to 
generate a symmetric duplex with respect to the sequence composition of the 
sense and antisense 3' overhangs. Symmetric 3' overhangs may help to ensure that 
the siRNPs are formed with approximately equal ratios of sense and antisense 
target RNA-cleaving siRNPs (see, Elbashir, Lendeckel and Tuschl (2001), Genes 
20 & Dev. 15: 188-200, incorporated by reference herein in its entirely) (Elbashir et 
al., 2001a). The modification of the overhang of the sense sequence of the siRNA 
duplex is not expected to affect targeted mRNA recognition, as the antisense 
siRNA strand guides target recognition. 

Alternatively, if the AGR2 target mRNA does not contain a 
25 suitable AA(N21) sequence, one may search for the sequence NA(N21). Further, 
the sequence of the sense strand and antisense strand may still be synthesized as 5' 
(N19)TT, as it is believed that the sequence of the 3 -most nucleotide of the 
antisense siRNA does not contribute to specificity. Unlike antisense or ribozyme 
technology, the secondary structure of the target mRNA does not appear to have a 
strong effect on silencing. See Harborth et al. (2001) J. Cell Science 114: 4557- 
4565, incorporated herein by reference in its entirety (Harborth et al., 2001). 

Transfection of AGR2 siRNA duplexes can be achieved using 
standard nucleic acid transfection methods, for example, OLIGOFECTAMINE 
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Reagent (commercially available from Invitrogen). An assay for AGR2 gene 
silencing is generally performed approximately 2 days after transfection. No- 
AGR2 gene silencing has been observed in the absence of transfection reagent, 
allowing for a comparative analysis of the wild type and silenced AGR2 
5 phenotypes. In a specific embodiment, for one well of a 24-well plate, 
approximately 0.84 \ig of the siRNA duplex is generally sufficient Cells are 
typically seeded the previous day, and are transfected at about 50% confluence. 
The choice of cell culture media and conditions are routine to those of skill in the 
art, and will vary with the choice of cell type. The efficiency of transfection may 
10 depend on the cell type, but also on the passage number and the confluency of the 
cells. The time and the manner of formation of siRNA-liposome complexes (e.g. 
inversion versus vortexing) are also critical. Low transfection efficiencies are the 
most frequent cause of unsuccessful AGR2 silencing. The efficiency of 
transfection needs to be carefully examined for each new cell line to be used. 
15 Preferred cells are derived from a mammal, more preferably from a rodent such as 
a rat or mouse, and most preferably from a human. Where used for therapeutic 
treatment, the cells are preferentially autologous, although non-autologous cell 
sources are also contemplated as within the scope of the present invention. 

For a control experiment, transfection of 0.84 \ig single-stranded 
20 sense AGR2 siRNA will have no effect on AGR2 silencing, and 0.84 fig antisense 
siRNA has a weak silencing effect when compared to 0.84 \ig of duplex siRNAs. 
Control experiments again allow for a comparative analysis of the wild type and 
silenced AGR2 phenotypes. To control for transfection efficiency, targeting of 
common proteins is typically performed, for example targeting of lamin A/C or 
25 transfection of a CMV-driven EGFP-expression plasmid (e.g. commercially 
available from Clontech). In the above example, a determination of the fraction of 
lamin A/C knockdown in cells is determined the next day by such techniques as 
immunofluorescence, Western blot, Northern blot or other similar assays for 
protein expression or gene expression. Lamin A/C monoclonal antibodies may be 
30 obtained from Santa Cruz Biotechnology. 

Depending on the abundance and the half life (or turnover) of the 
targeted AGR2 polynucleotide in a cell, a knock-down phenotype may become 
apparent after 1 to 3 days, or even later. In cases where no AGR2 knock-down 
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phenotype is observed, depletion of the AGR2 polynucleotide may be observed by 
immunofluorescence or Western blotting. If the AGR2 polynucleotide is still 
abundant after 3 days, cells need to be split and transferred to a fresh 24-well plate 
for re-transfection. If no knock-down of the targeted protein (AGR2 or a AGR2 
5 upstream or downstream gene) is observed, it may be desirable to analyze whether 
the target mRNA was effectively destroyed by the transfected siRNA duplex. Two 
days after transfection, total RNA is prepared, reverse transcribed using a target- 
specific primer, and PCR-amplified with a primer pair covering at least one exon- 
exon junction in order to control for amplification of pre-mRNAs. RT/PCR of a 

10 non-targeted mRNA is also needed as control Effective depletion of the mRNA 
yet undetectable reduction of target protein may indicate that a large reservoir of 
stable AGR2 protein may exist in the cell. Multiple transfection in sufficiently 
long intervals may be necessary until the target protein is finally depleted to a 
point where a phenotype may become apparent. If multiple transfection steps are 

15 required, cells are split 2 to 3 days after transfection. The cells may be transfected 
immediately after splitting. 

An inventive therapeutic method of the invention contemplates 
administering an AGR2 siRNA construct as therapy to compensate for increased 
or aberrant AGR2 expression or activity. The AGR2 ribopolynucleotide is 

20 obtained and processed into siRNA fragments as described. The AGR2 siRNA is 
administered to cells or tissues using known nucleic acid transfection techniques, 
as described above. An AGR2 siRNA specific for an AGR2 gene will decrease or 
knockdown AGR2 transcription products, which will lead to reduced AGR2 
polypeptide production, resulting in reduced AGR2 polypeptide activity in the 

25 cells or tissues. 

Particularly preferred in connection with the present invention are 
siRNAs comprising a double stranded nucleotide sequence wherein one strand is 
complementary to an at least 19, 20, 21, 22, 23, 24, or 25 nucleotide long segment 
of an mRNA encoding a mutein of the invention as described herein, said segment 
30 encoding an amino acid sequence comprising the amino acid or amino acid 
sequence which corresponds to any of the mutations defined previously in 
connection with these muteins. 
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Also preferred are siRNAs wherein said strand is complementary to 
an at least 19, 20, 21, 22, 23, 24, or 25 nucleotide long segment of an mRNA 
encoding the mouse Agr2 or the human AGR2 protein according to SEQ ID NO:3 
and SEQ ID NO:4, respectively, or an orthologue thereof having or at least 65%, 
5 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% amino acid identity compared to 
the mouse Agr2 or the human AGR2 protein as defined above, said segment being 
a non-coding segment and comprising a sequence corresponding to a mutation in 
the gene coding for said protein or orthologue which affects expression of said 
protein or orthologue. 

10 Furthermore preferred are siRNAs wherein said strand is 

complementary to an at least 19, 20, 21, 22, 23, 24, or 25 nucleotide long segment 
of an mRNA encoding a protein which affects expression or function of the mouse 
Agr2 or the human AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4, 
respectively, or an orthologue thereof having at least 65%, 70%, 75%, 80%, 85%, 
90%, 95%, 98%, or 99% amino acid identity compared to the mouse Agr2 or the 
human AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4, respectively. 

The above-mentioned segment may include sequences from the 5' 
untranslated (UT) region. Alternatively, or in addition, it may include sequences 
corresponding to the open reading frame (ORF). Again alternatively or in 
addition, it may include sequences from the 3' untranslated (UT) region. 

Prokaryotic and eukaryotic host cells transformed with the above 
siRNAs are likewise within the scope of the present invention. 

The present invention also encompasses a method of treating a 
disease or condition associated with the presence of an AGR2 protein in an 
individual comprising administering to the individual an RNAi construct that 
targets the mRNA of the protein (the mRNA that encodes the protein) for 
degradation. A specific RNAi construct includes a siRNA or a double stranded 
gene transcript that is processed into siRNAs. Upon treatment, the target protein is 
not produced or is not produced to the extent it would be in the absence of the 
treatment 

Where the AGR2 gene function is not correlated with a known 
phenotype, a control sample of cells or tissues from healthy individuals provides a 
reference standard for determining AGR2 expression levels. Expression levels are 
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detected using the assays described, e.g., RT-PCR, Northern blotting, Western 
blotting, ELISA, and the like. A subject sample of cells or tissues is taken from a 
mammal, preferably a human subject, suffering from a disease state. The AGR2 
ribopolynucleotide is used to produce siRNA constructs, that are specific for the 

5 AGR2 gene product. These cells or tissues are treated by administering AGR2 
siRNAs to the cells or tissues by methods described for the transfection of nucleic 
acids into a cell or tissue, and a change in AGR2 polypeptide or polynucleotide 
expression is observed in the subject sample relative to the control sample, using 
the assays described. This AGR2 gene knockdown approach provides a rapid 

10 method for determination of a AGR2-phenotype in the treated subject sample. The 
AGR2-phenotype observed in the treated subject sample thus serves as a marker 
for monitoring the course of a disease state during treatment 



Proteins and Amino Acids 

15 The present invention also provides, for example, murine and 

human mutated AGR2 amino acid sequences (muteins). The wild type murine and 
human amino acid sequences are shown in SEQ ID NO:3 and SEQ ID NO:4 
respectively. A mutated version of the mouse amino acid sequence wherein valin 
at position 137 is mutated to a glutamic acid is exemplified in SEQ ID NO:2. A 

20 mutated version of the human amino acid sequence wherein valin at position 137 
is mutated to a glutamic acid is exemplified in SEQ ID NO:30. 

More generally, the present invention provides a protein having at least 
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 
95%, at least 98%, or at least 99% amino acid identity compared to the mouse 

25 Agr2 or the human AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4, 
respectively. Also encompassed by the present invention are fragments of such 
proteins comprising at least 6, at least 7, at least 8, at least 9, at least 10, at least 
15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 50, at least 
60, at least 70, at least 80, at least 90, at least 100, at least 110, at least 120, at 

30 least 130, at least 140, at least 150, at least 160, at least 165, at least 170, at least 
171, at least 172, at least 173, or at least 174 contiguous amino acids having the 
above percentages of amino acid identity compared to the corresponding amino 
acids in SEQ ID NO:3 and SEQ ID NO:4. 
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In accordance with the invention described herein, the above 
protein or protein fragment comprises an amino acid or an amino acid sequence 
which corresponds to a mutation in the mouse Agr2 protein according to SEQ ID 
NO:3 which, if encoded by the mouse Agr2 gene and present in the genome of all 
or essentially all cells of a mouse in a homozygous manner, results in a phenotype 
associated with an alteration in goblet cell function compared to the corresponding 
wild-type animal. 

In an alternative embodiment, the protein or protein fragment 
comprises an amino acid or an amino acid sequence which corresponds to a 
mutation in the mouse Agr2 protein or the human AGR2 protein according to 
SEQ ID NO:3 and SEQ ID NO:4, respectively, which leads to an altered 
biological activity of the mutated protein when compared to the corresponding 
wild-type mouse Agr2 protein or human AGR2 protein in an in vitro assay. In 
vitro assays contemplated in this regard are, for example, those already explained 
in detail in connection with the non-human vertebrate animal above. 

In yet a further alternative embodiment, the protein or protein 
fragment comprises an amino acid or an amino acid sequence which corresponds 
to a mutation of the human AGR2 protein according to SEQ ID NO:4 which is 
indicative of an increased risk of a human subject of developing a medical 
condition associated with an alteration in goblet cell function, or indicative of an 
association of a medical condition in a human subject which is associated with an 
alteration in goblet cell function with altered AGR2 expression or function. The 
term "corresponds to" as used in the present and the preceding paragraphs refers 
to the fact that the allele reflects the mutation in the way explained previously in 
the present specification. Also, a mutation of the human AGR2 protein according 
to SEQ ID NO:4 referred to in the present paragraph is again of the kind described 
in more detail elsewhere herein, and identifiable by the methods described and 
claimed in the present specification. 

In a preferred embodiment, the protein of the invention represents 
an orthologue of the mouse Agr2 or the human AGR2 protein according to SEQ 
ID NO:3 and SEQ ID NO:4, preferably a vertebrate orthologue, in particular an 
orthologue wherein said vertebrate is an amphibian vertebrate, in particular 
Xenopus leavis. Alternatively, it may represent a mammalian orthologue, in 
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particular a rat, rabbit, hamster, dog, cat, sheep, or horse orthologue. It may also 
be a variant of the mouse Agr2 protein or the human AGR2 protein according to 
SEQ ID NO:3 and SEQ ID NO:4, respectively, or of said orthologue, allelic or 
otherwise, wherein certain amino acids or partial amino acid sequences have been 
replaced, added, or deleted 

Again in a preferred embodiment, the mutation mentioned above 
results in a deletion or substitution by another amino acid of an amino acid of said 
mouse Agr2 protein or human AGR2 protein according to SEQ ID NO:3 and SEQ 
ID NO:4, respectively. Alternatively, the mutation may result in an insertion of 
additional amino acids not normally present in the amino acid sequence of the 
mouse Agr2 protein or the human AGR2 protein defined above. 

The deletion, substitution, or insertion may furthermore occur in an 
evolutionary conserved region of said mouse Agr2 protein or said human AGR2 
protein. In particular, it may be a substitution of an amino acid which is identical 
or similar between mouse, rat, and human AGR2, preferably between mouse, rat, 
human, and Xenopus laevis AGR2, more preferably between mouse, rat, human, 
Xenopus laevis, and Caenorhabditis elegans AGR2, by another amino acid. Such 
amino acid may be a non-naturally occurring or a naturally ocurring amino acid. 
The skilled artisan will be readily able to determine regions which are generally 
evolutionary conserved amongst different species on the basis of sequence 
comparisons such as that shown in Figure 2. The amino acids identical or similar 
between the species specifically mentioned above will furthermore be readily 
identifiable by the skilled artisan on the basis of the amino acid sequence 
comparisons depicted in Figures 16, 17, and 18 and the accompanying Tables 
(Tables 1, 2, and 3, respectively). 

Preferably, the wild type residue of the modified AGR2 protein is 
replaced by an amino acid with different size and/or polarity, i.e., a non- 
conservative amino acid substitution, as defined below. 

Also preferred is an AGR2 mutein wherein residue 137 of AGR2 
according to SEQ ID NO:4 is replaced by an amino acid other than a large 
aliphatic, nonpolar amino acid, and preferably is replaced by an acidic amino acid 
and most preferably by a glutamic acid. 
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In one preferred embodiment a murine Agr2 mutein of the present 
invention has the amino acid sequence shown in SEQ ID NO:2. 

In a further preferred embodiment a human AGR2 mutein of the 
present invention has the amino acid sequence shown in SEQ ID NO:30. 
5 An "isolated" or "purified" polypeptide or protein, or a biologically 

active fragment thereof as described and claimed herein is substantially free of 
cellular material or other contaminating proteins from the cell or tissue source 
from which the polypeptide or protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically synthesized. The 
10 language "substantially free of cellular material" includes preparations of AGR2 
protein in which the protein is separated from cellular components of the cells 
from which the protein is isolated or in which it is recombinantly produced. 

The invention furthermore encompasses mature mouse Agr2 or 
human AGR2 proteins, or their vertebrate orthologues, e.g., the specific 
15 orthologues referred to above, which comprise an amino acid or amino acid 
sequences corresponding to a mutation as defined above. As used herein, a 
"mature" form of a polypeptide or protein may arise from a post-translational 
modification. Such additional processes include, by way of non-limiting example, 
proteolytic cleavage, e.g., cleavage of a leader sequence, glycosylation, 
20 myristoylation or phosphorylation. In general, a mature polypeptide or protein 
according to the present invention may result from the operation of one of these 
processes, or a combination of any of them. 

As mentioned above, when for example residue 137 of SEQ ID 
NO:3 is replaced by an amino acid with different size and/or polarity (excluding 
the wild type residue at this position), this is termed a non-conservative amino 
acid substitution. Non-conservative substitutions are defined as exchanges of an 
amino acid by another amino acid listed in a different group of the five standard 
amino acid groups shown below: 

1. small aliphatic, nonpolar or slightly polar residues: Ala, Ser, 
nir 9 (Pro),(Gly); 

2. negatively charged residues and their amides: Asn, Asp, Glu, 
Gin; 

3. positively charged residues: His, Arg, Lys; 
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4. large aliphatic, nonpolar residues: Met, Leu, He, Val, (Cys); 

5 . large aromatic residues : Phe, Tyr, Tip. 

Conservative substitutions are defined as exchanges of an amino 
5 acid by another amino acid listed within the same group of the five standard 
amino acid groups shown above. Three residues are parenthesized because of their 
special role in protein architecture. Gly is the only residue without a side-chain 
and therefore imparts flexibility to the chain. Pro has an unusual geometry which 
tightly constrains the chain. Cys can participate in disulfide bonds. 
10 The invention also provides novel chimeric or fusion proteins. As 

used herein, a novel "chimeric protein" or "fusion protein" comprises a novel 
AGR2 polypeptide linked to a non-AGR2 polypeptide (i.e., a polypeptide that 
does not comprise AGR2 or a fragment thereof). 

In one embodiment, the fusion protein is a GST-AGR2 heavy chain 
15 fusion protein in which the AGR2 sequences are fused to the C-terminus of the 
GST (glutathione-S-transferase) sequences. Such fusion proteins can facilitate the 
purification of recombinant AGR? polypeptides. 

In yet another embodiment, the fusion protein is a AGR2- 
immunoglobulin fusion protein in which the AGR2 sequences are fused to 
20 sequences derived from a member of the immunoglobulin protein family, 
especially Fc region polypeptides. Also contemplated are fusions of AGR2 
sequences (mutant proteins or fragments) fused to amino acid sequences that are 
commonly used to facilitate purification or labeling, e.g., polyhistidine tails (such 
as hexahistidine segments), FLAG tags, and streptavidin. 

The amino acid sequences of the present invention may be made by 
using peptide synthesis techniques well known in the art, such as solid phase 
peptide synthesis (see, for example, Fields et al. f "Principles and Practice of Solid 
Phase Synthesis" in Synthetic Peptides, A Users Guide, Grant, G.A., Ed., 
W.H. Freeman Co. NY. 1992, Chap. 3 pp. 77-183; Barlos, K. and Gatos, D. 
"Convergent Peptide Synthesis" in Fmoc Solid Phase Peptide Synthesis, Chan, 
W.C. and White, P.D. Eds., Oxford University Press, New York, 2000, Chap. 9: 
pp. 215-228) or by recombinant DNA manipulations and recombinant expression. 
Techniques for making substitution mutations at predetermined sites in DNA 

50 



WO 2004/056858 



PCT/EP2003/014834 



having known sequence are well known and include, for example, Ml 3 
mutagenesis. Manipulation of DNA sequences to produce variant proteins which 
manifests as substitutional, insertional or deletional variants are conveniently 
described, for example, in Sambrook et al. (1989), supra. 

5 

Antibodies 

A further aspect of the present invention are antibodies specifically 
recognizing an epitope in a mutein as described further below, wherein said 
epitope comprises the amino acid or the amino acid sequence in said protein 
which corresponds to the mutation described in connection with these muteins. 

Also included in the invention are antibodies to fragments of 
mutein AGR2 polypeptides (including amino terminal fragments), as well as 
antibodies to fusion proteins containing AGR2 mutein polypeptides or fragments 
of AGR2 mutein polypeptides. The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of 
immunoglobulin (Ig) molecules, i.e., molecules that contain an antigen binding 
site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, e.g., polyclonal, monoclonal, chimeric, single chain, Fab, Fab* and 
fragments, and a Fab expression library. In general, an antibody molecule obtained 
from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which 
differ from one another by the nature of the heavy chain present in the molecule. 
Certain classes have subclasses as well, such as IgGi, IgG2, and others. 
Furthermore, in humans, the light chain may be a kappa chain or a lambda chain. 
Reference herein to antibodies includes a reference to all such classes, subclasses 
and types of human antibody species. 

An AGR2 polypeptide, i.e., wild type or mutant AGR2, as 
described herein, may be intended to serve as an antigen, or a portion or fragment 
thereof, and additionally can be used as an immunogen to generate antibodies that 
immunospecifically bind the antigen, using standard techniques for polyclonal and 
monoclonal antibody preparation. Antigenic peptide fragments of the antigen for 
use as immunogens includes, e.g., at least 7 amino acid residues of the amino acid 
sequence of the mutated region such as an amino acid sequence shown in SEQ ID 
NO:2, and in SEQ ID NO:30 or in SEQ ID NO:3 and SEQ ID NO:4, respectively, 
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and encompasses an epitope thereof such that an antibody raised against the 
peptide forms a specific immune complex with the full length protein or with any 
fragment that contains the epitope. Preferably, the antigenic peptide comprises at 
least 10 amino acid residues, or at least 15 amino acid residues, or at least 20 
5 amino acid residues, or at least 30 amino acid residues. Preferred epitopes 
encompassed by the antigenic peptide are regions of the protein that are located on 
its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope 
encompassed by the antigenic peptide is a region of mutein or wild type AGR2 

10 polypeptide that is located on the surface of the protein, e.g., a hydrophilic region. 
A hydrophobicity analysis of a mutein or wild type AGR2 polypeptide will 
indicate which regions of a mutein or wild type AGR2 protein are particularly 
hydrophilic and, therefore, are likely to encode surface residues useful for 
targeting antibody production. As a means for targeting antibody production, 

15 hydropathy plots showing regions of hydrophilicity and hydrophobicity may be 
generated by any method well known in the art, including,, for example, the Kyte 
Doolittle or the Hopp Woods methods, either with or without Fourier 
transformation. See, e.g., (Hopp and Woods, 1981; Kyte and Doolittle, 1982b; 
Kyte and Doolittle, 1982a). Antibodies that are specific for one or more domains 

20 within an antigenic protein, or derivatives, fragments, analogs or homologs 
thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, 
homolog or ortholog thereof, may be utilized as an immunogen in the generation 
of antibodies that immunospecifically bind these protein components. 

25 Various procedures known within the art may be used for the 

production of polyclonal or monoclonal antibodies directed against a protein of 
the invention, or against derivatives, fragments, analogs, homologues or 
orthologues thereof. See, for example, Antibodies: A Laboratory Manual, 
Harlow and Lane (1988) Cold Spring Harbor Laboratory Press, Cold Spring 

30 Harbor, NY. Some of these antibodies are discussed below. 
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Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host 
animals (e.g., rabbit, goat, mouse or other mammal) may be immunized by one or 
more injections with the protein of the invention, a synthetic variant thereof, or a 

5 derivative of the foregoing. An appropriate immunogenic preparation can contain, 
for example, the naturally occurring immunogenic protein, a chemically 
synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be 
conjugated to a second protein known to be immunogenic in the mammal being 

10 immunized. Examples of such immunogenic proteins include but are not limited 
to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. 

The preparation can further include an adjuvant. Various adjuvants 
used to increase the immunological response include, but are not limited to, 

15 Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), 
surface active substances (e.g., lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as 
Bacille Calmette-Guerin and Corynebacterium parvum, or similar 
immunostimulatory agents. Additional examples of adjuvants which can be 

20 employed include MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the 
immunogenic protein can be isolated from the mammal (e.g., from the blood) and 
further purified by well known techniques, such as affinity chromatography using 

25 protein A or protein G, which provide primarily the IgG fraction of immune 
serum. Subsequently, or alternatively, the specific antigen which is the target of 
the immunoglobulin sought, or an epitope thereof, may be immobilized on a 
column to purify the immune specific antibody by immunoafSnity 
chromatography. 

30 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody 
composition", as used herein, refers to a population of antibody molecules that 
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contain only one molecular species of antibody molecule consisting of a unique 
light chain gene product and a unique heavy chain gene product. In particular, the 
complementarity determining regions (CDRs) of the monoclonal antibody are 
identical in all the molecules of the population. MAbs thus contain an antigen 
5 binding site capable of immunoreacting with a particular epitope of the antigen 
characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, 
such as those described by Kohler and Milstein (Kohler and Milstein, 1975). In a 
hybridoma method, a mouse, hamster, or other appropriate host animal, is 

10 typically immunized with an immunizing agent to elicit lymphocytes that produce 
or are capable of producing antibodies that will specifically bind to the 
immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a 
fragment thereof or a fusion protein thereof. Generally, either peripheral blood 

15 lymphocytes are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The 
lymphocytes are then fused with an immortalized cell line using a suitable fusing 
agent, such as polyethylene glycol, to form a hybridoma cell. Goding, 
Monoclonal Antibodies: Principles and Practice, Academic Press, (1986) 

20 pp. 59-103. Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or 
mouse myeloma cell lines are employed. The hybridoma cells can be cultured in a 
suitable culture medium that preferably contains one or more substances that 
inhibit the growth or survival of the unfused, immortalized cells. For example, if 

25 the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl 
transferase (HGPRT or HPRT), the culture medium for the hybridomas typically 
will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, 

30 support stable high level expression of antibody by the selected antibody- 
producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized cell lines are murine myeloma lines, which can be 
obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 
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California and the American Type Culture Collection, Manassas, Virginia. Human 
myeloma and mouse-human heteromyeloma cell lines also have been described 
for the production of human monoclonal antibodies ((Kozbor et al., 1984), 
Brodeur et al, Monoclonal Antibody Production Techniques and 

5 Applications, Marcel Dekker, Inc., New York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can 
then be assayed for the presence of monoclonal antibodies directed against the 
antigen. Preferably, the binding specificity of monoclonal antibodies produced by 
the hybridoma cells is determined by immunoprecipitation or by an in vitro 

10 binding assay, such as radioimmunoassay (RIA) or enzyme-linked 
immunoabsorbent assay (ELISA). Such techniques and assays are known in the 
art. The binding affinity of the monoclonal antibody can, for example, be 
determined by the Scatchard analysis of Munson and Rodbard (Munson and 
Rodbard, 1980). Preferably, antibodies having a high degree of specificity and a 

15 high binding affinity for the target antigen are isolated. 

After the desired hybridoma cells are identified, the clones can be 
subcloned by limiting dilution procedures and grown by standard methods. 
Suitable culture media for this purpose include, for example, Dulbecco's Modified 
Eagle's Medium and RPMI-1640 medium. Alternatively, the hybridoma cells can 

20 be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be 
isolated or purified from the culture medium or ascites fluid by conventional 
immunoglobulin purification procedures such as, for example, protein A- 
Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or 

25 affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA 
methods, such as those described in US Patent No. 4,816,567. DNA encoding the 
monoclonal antibodies of the invention can be readily isolated and sequenced 
using conventional procedures (e.g., by using oligonucleotide probes that are 

30 capable of binding specifically to genes encoding the heavy and light chains of 
murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA can be placed into expression 
vectors, which are then transfected into host cells such as simian COS cells, 
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Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise 
produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies 
in the recombinant host cells. The DNA also can be modified, for example, by 
substituting the coding sequence for human heavy and light chain constant 

5 domains in place of the homologous murine sequences (US Patent No. 4,816,567; 
Morrison, 1994b) or by covalently joining to the immunoglobulin coding 
sequence all or part of the coding sequence for a non-immunoglobulin 
polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the 
constant domains of an antibody of the invention, or can be substituted for the 

10 variable domains of one antigen-combining site of an antibody of the invention to 
create a chimeric bivalent antibody. 

Humanized Antibodies 

The antibodies directed against the protein antigens of the 
invention can further comprise humanized antibodies or human antibodies. These 

15 antibodies are suitable for administration to humans without engendering an 
immune response by the human against the administered immunoglobulin. 
Humanized forms of antibodies are chimeric immunoglobulins, immunoglobulin 
chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen-binding 
subsequences of antibodies) that are principally comprised of the sequence of a 

20 human immunoglobulin, and contain minimal sequence derived from a non- 
human immunoglobulin. Humanization can be performed following the method of 
Winter and co-workers (Jones et al., 1986; Riechmann et al., 1988b; Verhoeyen et 
al., 1988a; Riechmann et al., 1988a; Verhoeyen et al., 1988b), by substituting 
rodent CDRs or CDR sequences for the corresponding sequences of a human 

25 antibody. (See also US Patent No. 5,225,539.) In some instances, Fv framework 
residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies can also comprise residues, which are found 
neither in the recipient antibody nor in the imported CDR or framework 
sequences. In general, the humanized antibody will comprise substantially all of at 

30 least one, and typically two, variable domains, in which all or substantially all of 
the CDR regions correspond to those of a non-human immunoglobulin and all or 
substantially all of the framework regions are those of a human immunoglobulin 
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consensus sequence. The humanized antibody optimally also will comprise at 
least a portion of an immunoglobulin constant region (Fc), typically that of a 
human immunoglobulin (Jones et al., 1986; Riechmann et al., 1988b; Riechmann 
etal., 1988a). 

5 Human Antibodies 

Fully human antibodies relate to antibody molecules in which 
essentially the entire sequences of both the light chain and the heavy chain, 
including the CDRs, arise from human genes. Such antibodies are termed "human 
antibodies", or "fully human antibodies" herein. Human monoclonal antibodies 

10 can be prepared by the trioma technique; the human B-ceil hybridoma technique 
and the EBV hybridoma technique to produce human monoclonal antibodies (see 
Cole, et al., 1985 In: Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc., pp. 77-96). Human monoclonal antibodies may be utilized in the 
practice of the present invention and may be produced by using human 

15 hybridomas (Cote et al., 1983) or by transforming human B-cells with Epstein 
Barr Virus in vitro (see Cole, et al. (1985) In: Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using 
additional techniques, including phage display libraries (Hoogenboom and Winter, 

20 1992; Marks et al., 1991a; Marks et al., 1991b). Similarly, human antibodies can 
be made by introducing human immunoglobulin loci into transgenic animals, e.g., 
mice in which the endogenous immunoglobulin genes have been partially or 
completely inactivated. Upon challenge, human antibody production is observed, 
which closely resembles that seen in humans in all respects, including gene 

25 rearrangement, assembly, and antibody repertoire. This approach is described, for 
example, in US Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in here: Fishwild et al., 1996b; Lonberg et al., 1994b; 
Lonberg and Huszar, 1995b; Marks et al., 1992; Morrison, 1994b; Neuberger, 
1996b; Fishwild et al., 1996a; Lonberg et al., 1994a; Lonberg and Huszar, 1995a; 

30 Morrison, 1994a; Neuberger, 1996a. 

Human antibodies may additionally be produced using transgenic 
non-human animals which are modified so as to produce fully human antibodies 



57 



WO 2004/056858 



PCT/EP2003/014834 



rather than the animal's endogenous antibodies in response to challenge by an 
antigen. See PCT publication WO94/02602. The endogenous genes encoding the 
heavy and light immunoglobulin chains in the nonhuman host have been 
incapacitated, and active loci encoding human heavy and light chain 
5 immunoglobulins are inserted into the host's genome. The human genes are 
incorporated, for example, using yeast artificial chromosomes containing the 
requisite human DNA segments. An animal which provides all the desired 
modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the 
10 modifications. The preferred embodiment of such a nonhuman animal is a mouse, 
and is termed the Xenomouse™ as disclosed in PCT publications WO 96/33735 
and WO 96/34096. This animal produces B cells which secrete fully human 
immunoglobulins. The antibodies can be obtained directly from the animal after 
immunization with an immunogen of interest, as, for example, a preparation of a 
15 polyclonal antibody, or alternatively from immortalized B cells derived from the 
animal, such as hybridomas producing monoclonal antibodies. Additionally, the 
genes encoding the immunoglobulins with human variable regions can be 
recovered and expressed to obtain the antibodies directly, or can be further 
modified to obtain analogs of antibodies such as, for example, single chain Fv 
molecules. 

An example of a method of producing a nonhuman host, 
exemplified as a mouse, lacking expression of an endogenous immunoglobulin 
heavy chain is disclosed in US Patent No. 5,939,598. It can be obtained by a 
method including deleting the J segment genes from at least one endogenous 
heavy chain locus in an embryonic stem cell to prevent rearrangement of the locus 
and to prevent formation of a transcript of a rearranged immunoglobulin heavy 
chain locus, the deletion being effected by a targeting vector containing a gene 
encoding a selectable marker; and producing from the embryonic stem cell a 
transgenic mouse whose somatic and germ cells contain the gene encoding the 
selectable marker. 

A method for producing an antibody of interest, such as a human 
antibody, is disclosed in US Patent No. 5,916,771. It includes introducing an 
expression vector that contains a nucleotide sequence encoding a heavy chain into 
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one mammalian host cell in culture, introducing an expression vector containing a 
nucleotide sequence encoding a light chain into another mammalian host cell, and 
fusing the two cells to form a hybrid cell. The hybrid cell expresses an antibody 
containing the heavy chain and the light chain. 
5 In a further improvement on this procedure, a method for 

identifying a clinically relevant epitope on an immunogen, and a correlative 
method for selecting an antibody that binds immunospecifically to the relevant 
epitope with high affinity, are disclosed in PCT publication WO 99/53049. 

F ah Fragments and Single Chain Antibodies 

10 According to the invention, techniques can be adapted for the 

production of single-chain antibodies specific to an antigenic protein of the 
invention (see e.g., US Patent No. 4,946,778). In addition, methods can be adapted 
for the construction of Fab expression libraries (Huse et al., 1989) to allow rapid 
and effective identification of monoclonal Fab fragments with the desired 

15 specificity for a protein or derivatives, fragments, analogs or homologs thereof. 
Antibody fragments that contain the idiotypes to a protein antigen may be 
produced by techniques known in the art including, but not limited to: (i) an F m2 
fragment produced by pepsin digestion of an antibody molecule; (ii) an F a b 
fragment generated by reducing the disulfide bridges of an F^ fragment; (iii) an 

20 Fab fragment generated by the treatment of the antibody molecule with papain and 
a reducing agent and (iv) F v fragments. 

Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or 
humanized, antibodies that have binding specificities for at least two different 
25 antigens. In the present case, one of the binding specificities is for an antigenic 
protein of the invention. The second binding target is any other antigen, and 
advantageously is a cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. 
Traditionally, the recombinant production of bispecific antibodies is based on the 
30 co-expression of two immunoglobulin heavy-chain/light-chain pairs, where the 
two heavy chains have different specificities (Milstein and Cuello, 1983). Because 
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of the random assortment of immunoglobulin heavy and light chains, these 
hybridomas (quadromas) produce a potential mixture of ten different antibody 
molecules, of which only one has the correct bispecific structure. The purification 
of the correct molecule is usually accomplished by affinity chromatography steps. 
5 Similar procedures are disclosed in WO 93/08829 and in Traunecker et ah 
(Traunecker et al., 1991). 

Antibody variable domains with the desired binding specificities 
(antibody-antigen combining sites) can be fused to immunoglobulin constant 
domain sequences. The fusion preferably is with an immunoglobulin heavy-chain 
10 constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It 
is preferred to have the first heavy-chain constant region (CHI) containing the site 
necessary for light-chain binding present in at least one of the fusions. DNAs 
encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are 
15 co-transfected into a suitable host organism. For further details of generating 
bispecific antibodies see, for example, Suresh et ah (Suresh et al., 1986). 

According to another approach described in WO 96/27011, the 
interface between a pair of antibody molecules can be engineered to maximize the 
percentage of heterodimers which are recovered from recombinant cell culture. 
The preferred interface comprises at least a part of the CH3 region of an antibody 
constant domain. In this method, one or more small amino acid side chains from 
the interface of the first antibody molecule are replaced with larger side chains 
(e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size 
to the large side chain(s) are created on the interface of the second antibody 
molecule by replacing large amino acid side chains with smaller ones (e.g. alanine 
or threonine). This provides a mechanism for increasing the yield of the 
heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or 
antibody fragments (e.g. F(ab*) 2 bispecific antibodies). Techniques for generating 
bispecific antibodies from antibody fragments have been described in the 
literature. For example, bispecific antibodies can be prepared using chemical 
linkage. Brennan et ah (Brennan et al., 1985) describe a procedure wherein intact 
antibodies are proteolytically cleaved to generate F(ab')2 fragments. These 
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fragments are reduced in the presence of the dithiol complexing agent sodium 
arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide 
formation. The Fab' fragments generated are then converted to thionitrobenzoate 
(TNB) derivatives. One of the Fab'-TNB derivatives is then reconverted to the 
5 Fab' -thiol by reduction with mercaptoethylamine and is mixed with an equimolar 
amount of the other Fab'-TNB derivative to form the bispecific antibody. The 
bispecific antibodies produced can be used as agents for the selective 
immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli 
10 and chemically coupled to form bispecific antibodies. Shalaby et al (Shalaby et 
al., 1992) describe the production of a fully humanized bispecific antibody F(ab') 2 
molecule. Each Fab' fragment was separately secreted from E. coli and subjected 
to directed chemical coupling in vitro to form the bispecific antibody. The 
bispecific antibody thus formed was able to bind to cells overexpressing the 
15 ErbB2 receptor and normal human T cells, as well as trigger the lytic activity of 
human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody 
fragments directly from recombinant cell culture have also been described. For 
example, bispecific antibodies have been produced using leucine zippers 
(Kostelny et al., 1992). The leucine zipper peptides from the Fos and Jun proteins 
were linked to the Fab' portions of two different antibodies by gene fusion. The 
antibody homodimers were reduced at the hinge region to form monomers and 
then re-oxidized to form the antibody heterodimers. This method can also be 
utilized for the production of antibody homodimers. The "diabody" technology 
(Holliger et al., 1993) has provided an alternative mechanism for making 
bispecific antibody fragments. The fragments comprise a heavy-chain variable 
domain (V H ) connected to a light-chain variable domain (Vl) by a linker which is 
too short to allow pairing between the two domains on the same chain. 
Accordingly, the Vh and V L domains of one fragment are forced to pair with the 
complementary V L and V H domains of another fragment, thereby forming two 
antigen-binding sites. Another strategy for making bispecific antibody fragments 
by the use of single-chain Fv (sFv) dimers has also been reported (Gruber et al., 
1994). 
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Antibodies with more than two valencies are contemplated. For 
example, trispecific antibodies can be prepared (Tutt et al., 1991). 

Exemplary bispecific antibodies can bind to two different epitopes, 
at least one of which originates in the protein antigen of the invention. Bispecific 
5 antibodies can also be used to direct various agents to cells, which express a 
particular antigen. These antibodies possess an antigen-binding arm and an arm, 
which binds an agent such as a radionuclide chelator (e.g., EOTUBE, DPTA, 
DOTA, or TETA). 

Heteroconiueate Antibodies 

Heteroconjugate antibodies are also within the scope of the present 
invention. Heteroconjugate antibodies are composed of two covalently joined 
antibodies. Such antibodies have, for example, been proposed to target immune 
system cells to unwanted cells (U.S. Patent No. 4,676,980), and for treatment of 
HIV infection (WO 91/00360; WO 92/200373; EP 03089). It is contemplated that 
the antibodies can be prepared in vitro using known methods in synthetic protein 
chemistry, including those involving cross-linking agents. For example, 
immunotoxins can be constructed using a disulfide exchange reaction or by 
forming a thioether bond. Examples of suitable reagents for this purpose include 
iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for 
example, in US Patent No. 4,676,980. 

Effector Function Engineering 

It can be desirable to modify the antibody of the invention with 
respect to effector function, so as to enhance, e.g., the effectiveness of the 
antibody. For example, cysteine residue(s) can be introduced into the Fc region, 
thereby allowing interchain disulfide bond formation in this region. The 
homodimeric antibody thus generated can have improved internalization 
capability and/or increased complement-mediated cell killing and antibody- 
dependent cellular cytotoxicity (ADCC) (Caron et al., 1992; Shopes, 1992a; 
Shopes, 1992b). Homodimeric antibodies with enhanced anti-tumor activity can 
also be prepared using heterobifunctional cross-linkers as described in Wolff et al 
(Wolff et al., 1993). Alternatively, an antibody can be engineered that has dual Fc 
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regions and can thereby have enhanced complement lysis and ADCC capabilities 
(Stevenson et al, 1989). 

Immunoconiugates 

The invention also pertains to immunoconjugates comprising an 
5 antibody conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin 
(e.g., an enzymatically active toxin of bacterial, fungal, plant, or animal origin, or 
fragments thereof), or a radioactive isotope (i.e., a radioconjugate). 

Enzymatically active toxins and fragments thereof that can be used 
include diphtheria A chain, nonbinding active fragments of diphtheria toxin, 
10 exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, 
modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, 
Phytolaca americana proteins (PAPI, PAPII, and PAP-S), momordica charantia 
inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, mitogellin, 
restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
15 radionuclides are available for the production of radioconjugated antibodies. 
Examples include 212 Bi, 131 1, 13, In, *°Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a 
variety of bifunctional protein-coupling agents such as N-succinimidyl-3-(2- 
pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional derivatives of 
20 imidoesters (such as dimethyl adipimidate HCL), active esters (such as 
disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 
compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium 
derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates 
(such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 
25 l,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can be 
prepared as described (Vitetta et al., 1983). Carbon-14-labeled 1- 
isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is 
an exemplary chelating agent for conjugation of radionucleotide to the antibody. 
See WO94/11026. 

30 In another embodiment, the antibody can be conjugated to a 

"receptor" (such streptavidin) for utilization in tumor pretargeting wherein the 
antibody-receptor conjugate is administered to the patient, followed by removal of 
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unbound conjugate from the circulation using a clearing agent and then 
administration of a "ligand" (e.g., avidin) that is in turn conjugated to a cytotoxic 
agent. 

Immonoconjugates according to the present invention are 
5 furthermore those comprising an antibody as described above conjugated to an 
imaging agent. Imaging agents suitable in this regard are, for example, again 
certain radioactive isotopes. Suitable in this regard are 18 F, ^Cu, 67 Ga, 68 Ga, 
99m Tc, ni In, 123 I, 125 I, ,31 I, ,69 Yb, 186 Re, and 201 T1. Particularly preferred in this 
regard is 99m Tc. The radioactive isotopes will suitably be conjugated to the 
10 antibody via a chelating group that is covalently attached to the antibody and is 
capable of chelating the radioactive isotope. 

Anticalins 

Anticalins are engineered proteins with antibody-like binding 
15 functions derived from natural lipocalins as a scaffold. These small monomeric 
proteins of only about 150 to 190 amino acids may have certain competitive 
advantages over antibodies, e.g., an increased binding specificity and improved 
tissue penetration, for example in the case of solid tumors. The anticalins of the 
present invention preferably bind their ligands with high specificity and affinity in 
20 the nanomolar range, e.g., in the low nanomolar range with K(D) values ranging 
between 12 nM and 35 nM. The set of four loops of anticalins may be easily 
manipulated at the genetic level (Weiss and Lowmann, 2000; Skerra, 2001). A 
preferred anticalin according to the present invention specifically binds to the 
AGR2 muteins as described herein. Another preferred anticalin specifically binds 
25 to the wild type AGR2 protein, e.g., the AGR2 proteins according to SEQ ID 
NO:3orSEQIDNO:4. 

Methods for producing aptamers specific for proteins and nucleic 
acids are known. See, e.g., US Patent 5,840,867, US Patent 5,756,291, and US 
Patent 5,582,981. 

30 

Vectors and Cells Expressing AGR2 Protein 
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Another aspect of the invention pertains to vectors, preferably 
expression vectors, containing a nucleic acid encoding a AGR2 mutein, or 
derivatives, fragments, analogs or homologs thereof. As used herein, the term 
"vector" refers to a nucleic acid molecule capable of transporting another nucleic 
5 acid to which it has been linked One type of vector is a "plasmid", which refers to 
a circular double stranded circular DNA molecule into which additional DNA 
segments can be ligated. Another type of vector is a viral vector, wherein 
additional DNA segments can be ligated into the viral genome. Certain vectors are 
capable of autonomous replication in a host cell into which they are introduced 
10 (e.g., bacterial vectors having a bacterial origin of replication and episomal 
mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors are 
capable of directing the expression of genes to which they are operatively linked. 
15 Such vectors are referred to herein as "expression vectors". 

A host cell of the invention, such as a prokaryotic or eukaryotic 
host cell in culture, can be used to produce (i.e., express) AGR2 mutein. 
Accordingly, the invention further provides methods for producing AGR2 mutein 
using the host cells of the invention. In one embodiment, the method comprises 
20 culturing the host cell of invention (into which a recombinant expression vector 
encoding AGR2 mutein protein has been introduced) in a suitable medium such 
that AGR2 mutein is produced. In another embodiment, the method further 
comprises isolating AGR2 mutein from the medium or the host cell. 

The host cells of the invention can also be used to produce non- 
25 human transgenic animals. For example, in one embodiment, a host cell of the 
invention is a fertilized oocyte or an embryonic stem cell into which AGR2 
protein-coding sequences have been introduced. Such host cells can then be used 
to create non-human transgenic animals in which exogenous AGR2 sequences 
have been introduced into their genome or homologous recombinant animals in 
30 which endogenous AGR2 sequences have been altered. Such animals are useful 
for studying the function and/or activity of AGR2 protein and for identifying 
and/or evaluating modulators of AGR2 protein activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more 
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preferably a rodent such as a rat or mouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of transgenic animals include non- 
human primates, sheep, dogs, cows, goats, chickens, amphibians, etc. Standard 
methods are known in the art that may be used in conjunction with the 
5 polynucleotides and of the invention and methods described herein to produce a 
transgenic animal expressing a modified AGR2 of the invention. 

Methods of Screening for Desease-Relevant AGR2 Alleles 

In one aspect, the present invention relates to a method of 

10 identifying a protein or nucleic acid marker indicative of an increased risk of a 
human subject of developing a medical condition associated with an alteration in 
goblet cell function, said method comprising the step of analyzing a test sample 
derived from a human subject for the presence of a difference compared to a 
similar test sample if derived from a human subject unaffected by or known not to 

15 be at risk of developing said condition, wherein said difference is indicative of the 
presence of a mutation in an allele of the gene coding for the AGR2 protein 
according to SEQ ID NO:4, or in an allele of a gene coding for a protein which 
affects expression or function of said AGR2 protein. 

The present invention furthermore relates to a method of 

20 identifying a protein or nucleic acid marker indicative of an association of a 
medical condition in a human subject which is associated with an alteration in 
goblet cell function with altered AGR2 expression or function, said method 
comprising the step of analyzing a test sample derived from a human subject for 
the presence of a difference compared to a similar test sample if derived from a 

25 human subject unaffected by or known not to be at risk of developing said 
condition, wherein said difference is indicative of the presence of a mutation in an 
allele of the gene coding for the AGR2 protein according to SEQ ID NO:4, or in 
an allele of a gene coding for a protein which affects expression or function of 
said AGR2 protein. 

30 In the above methods, the test sample derived from a human 

subject may be directly obtained from said human subject. It may, however, also 
be a sample that has been obtained previously. Also included test samples 
according to the invention are, for example, cDNA preparations that have been 
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prepared from mRNA obtained from a tissue sample from a human subject at an 
earlier stage. It may also be cloned or PCR-amplified DNA that originates from 
DNA contained in such tissue sample obtained at an earlier stage. 

According to the claimed method, the test sample will be analyzed 

5 for a difference to a similar test sample derived from a human subject unaffected 
by or known not to be at risk of developing a medical condition associated with an 
alteration in goblet cell function. While the method may include actually deriving 
or directly obtaining a test sample from such a human subject for comparative 
purposes, the necessary information regarding the relevant structural features and 

10 properties of such similar test sample to be used for comparison will often already 
be available. Thus, it will often be sufficient for the purposes of the above 
methods of the invention to perform an analysis for a difference to a similar test 
sample as it would be observed if said similar test sample were in fact obtained 
from a human subject unaffected by or known not to be at risk of developing the 

15 above medical condition. 

The test sample may be a nucleic acid sample, e.g., mRNA (or 
cDNA derived therefrom), or genomic DNA. 

It may also be a protein sample. 

The difference analyzed may be one relating to the expression level 
20 of said nucleic acid or protein. Alternatively, it may be analyzed whether there is a 
difference in terms of the nucleotide or the amino acid sequence level. 

Accordingly, the above methods of the invention include 
embodiments wherein the step of analysis for differences between the test samples 
comprises the partial or complete determination of the sequence of the nucleic 
25 acid, or a PCR-amplified portion of the nucleic acid, of the test sample, and 
optionally also of the nucleic acid or at PCR-amplified portion of the nucleic acid 
of the similar test sample (or the similar test samples). 

Suitable methods for the determination of partial or complete 
nucleic acid sequences, and thus, detection of the above-mentioned differences, 
30 are well known to the skilled artisan. They include, for example, Southern 
blotting, TGGE (temperature gradient gel electrophoresis), DGGE (denaturing 
gradient gel electrophoresis), SCCP (single chain conformation polymorphism) 
detection, and the like. High throughput sequence analysis methods such as those 
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described by Kristensen et al (Kristensen et al., BioTechniques 30 (2001), 318- 
332), which is incorporated herein by reference in its entirety, are likewise 
suitable, and hence, contemplated in connection with the present invention. 

Suitable methods for the determination of partial or complete 
5 amino acid sequences are likewise well known, and include, for example, 
detection of particular epitopes within a protein sample via specific antibodies in 
dot blot, slot blot, or Western blot assays, or via ELISAs or RIAs, or partial amino 
acid sequence determination on a sequencer via Edman degradation. Also, high- 
throughput methods may again be employed. 

A further aspect of the present invention is represented by a 
method for identifying a predisposition of a human subject for developing, a 
medical condition associated with an alteration in goblet cell function, said 
method comprising the step of determining whether a test sample derived from 
said human subject indicates the presence of a mutation in an allele of the gene 
coding for the AGR2 protein according to SEQ ID NO:4 indicative of an 
increased risk of said human subject of developing said medical condition. 

Also contemplated in connection with the present invention is a 
method for determining whether a medical condition in a human subject which is 
associated with an alteration in goblet cell function is associated with altered 
AGR2 expression or function, said method comprising the step of determining 
whether a test sample derived from said human subject indicates the presence of a 
mutation in an allele of the gene coding for the AGR2 protein according to SEQ 
ID NO:4 indicative of an altered AGR2 expression or function. 

As in the case of the methods described above, while the methods 
described in the two preceding paragraphs may involve that the test sample is 
derived from the human subject directly, it may also be a sample that has been 
obtained previously. Furthermore, suitable test samples according to the invention 
are, for example, cDNA preparations that have been prepared from mRNA 
obtained from a tissue sample from a human subject at an earlier stage. It may also 
again be cloned or PCR-amplified DNA that originates from DNA contained in 
such tissue sample obtained at an earlier stage. 

Again, the previously mentioned methods of determining partial or 
complete nucleic acid or amino acid sequences may be employed for the step of 
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determining whether the test sample (which may be a nucleic acid or protein test 
sample as previously defined) indicates the presence of said mutation. 

According to the above methods of identifying a predisposition in a 
human subject of developing a medical condition associated with an alteration in 

5 goblet cell function, or determining a potential association between such a medical 
condition with altered AGR2 expression or function, the test sample is analyzed 
for the presence of a mutation in an allele of the AGR2 gene which is either 
indicative of an increased risk of developing such a medical condition, or of an 
altered AGR2 expression or function. It will be appreciated that such mutations 

10 are inter alia those referred to herein in connection with the proteins and nucleic 
acids according to the invention, and that mutations of this kind may be readily 
identified, for example, by the in vitro assays or the animal model referred to in 
this regard. They may also be identified by any of the afore-mentioned methods of 
screening for disease-relevant AGR2 alleles. 

15 

Pharmaceutical Compositions 

The invention also includes pharmaceutical compositions 
containing agents that can modulate AGR2 activity, i.e., AGR2 mutein or wild 
type activity. These agents include biomolecules such as proteins, muteins, 

20 kinases, phosphatases, antibodies, antibody fragments, nucleic acids, ribozymes, 
anticalins, and aptamers as described herein, as well as pharmaceutical 
compositions containing antibodies to them (e.g., antibodies to muteins or wild- 
type proteins,, anti-idotypic antibodies). In addition, the agent may also include 
chemical compounds, e.g., small molecule agonists or antagonists, that may affect 

25 AGR2 directly. Furthermore, the agents may be biomolecules and chemical 
compounds, such as the ones listed above or below, that affect the interaction 
between AGR2, i.e., AGR2 mutein or wild type protein, and its physiologic 
ligands, including the cell membrane. 

The compositions are preferably suitable for internal use and 

30 include an effective amount of a pharmacologically active compound of the 
invention, alone or in combination, with one or more pharmaceutically acceptable 
carriers. The compounds are especially useful in that they have very low, if any 
toxicity. 
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The agents of this invention, and antibodies thereto, may be used in 
pharmaceutical compositions, when combined with a pharmaceutically acceptable 
carrier. As used herein, "pharmaceutically acceptable carrier" is intended to 
include any and all solvents, dispersion media, coatings, antibacterial and 
5 antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. Suitable carriers are described in 
the most recent edition of REMINGTON'S PHARMACEUTICAL SCIENCES (18th ed.), 
Alfonso R. Gennaro, ed. (Mack Publishing Co., Easton, PA 1990), a standard 
reference text in the field, which is incorporated herein by reference. Preferred 
10 examples of such carriers or diluents include, but are not limited to, water, saline, 
finger's solutions, dextrose solution, and 5% human serum albumin. Liposomes 
and non-aqueous vehicles such as fixed oils may also be used. The use of such 
media and agents for pharmaceutically active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active 
15 compound, use thereof in the compositions is contemplated. Supplementary active 
compounds can also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be 
compatible with its intended route of administration. Examples of routes of 
administration include parenteral, e.g., intravenous, intradermal, subcutaneous, 
20 oral (e.g., inhalation), transdermal (i.e., topical), transmucosal, and rectal 
administration. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include the following components: a sterile diluent 
such as water for injection, saline solution, fixed oils, polyethylene glycols, 
glycerine, propylene glycol or other synthetic solvents; antibacterial_agents_ such 
25 as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid 
(EDTA); buffers such as acetates, citrates or phosphates, and agents for the 
adjustment of tonicity such as sodium chloride or dextrose. The pH can be 
adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 
30 parenteral preparation can be enclosed in ampoules, disposable syringes or 
multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include 
sterile aqueous solutions (where water soluble) or dispersions and sterile powders 
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for the extemporaneous preparation of sterile injectable solutions or dispersion. 
For intravenous administration, suitable carriers include physiological saline, 
bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ, U.S.A.) or 
phosphate buffered saline (PBS). In all cases, the composition must be sterile and 
5 should be fluid to the extent that easy syringeability exists. It must be stable under 
the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. The carrier 
can be a solvent or dispersion medium containing, for example, water, ethanol, 
polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, 

10 and the like), and suitable mixtures thereof. The proper fluidity can be maintained, 
for example, by the use of a coating such as lecithin, by the maintenance of the 
required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 

15 ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to 
include isotonic agents, for example, sugars, polyalcohols such as manitol, 
sorbitol, and sodium chloride in the composition. Prolonged absorption of the 
injectable compositions can be brought about by including in the composition an 
agent which delays absorption, for example, aluminum monostearate and gelatin. 

20 For instance, for oral administration in the form of a tablet or 

capsule (e.g., a gelatin capsule), the active drug component can be combined with 
an oral, non-toxic phannaceutically acceptable inert carrier such as ethanol, 
glycerol, water and the like. Moreover, when desired or necessary, suitable 
binders, lubricants, disintegrating agents and coloring agents can also be 

25 incorporated into the mixture. Suitable binders include starch, magnesium 
aluminum silicate, starch paste, gelatin, methylcellulose, sodium 
carboxymethylcellulose and/or polyvinylpyrrolidone, natural sugars such as 
glucose or beta-lactose, corn sweeteners, natural and synthetic gums such as 
acacia, tragacanth or sodium alginate, polyethylene glycol, waxes and the like. 

30 Lubricants used in these dosage forms include sodium oleate, sodium stearate, 
magnesium stearate, sodium benzoate, sodium acetate, sodium chloride, silica, 
talcum, stearic acid, its magnesium or calcium salt and/or polyethyleneglycol-and ■- 
the like. Disintegrators include, without limitation, starch, methyl cellulose, agar, 
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bentonite, xanthan gum starches, agar, alginic acid or its sodium salt, or 
effervescent mixtures, and the like. Diluents, include, e.g., lactose, dextrose, 
sucrose, mannitol, sorbitol, cellulose and/or glycine. 

Injectable compositions are preferably aqueous isotonic solutions 
5 or suspensions, and suppositories are advantageously prepared from fatty 
emulsions or suspensions. The compositions may be sterilized and/or contain 
adjuvants, such as preserving, stabilizing, wetting or emulsifying agents, solution 
promoters, salts for regulating the osmotic pressure and/or buffers. In addition, 
they may also contain other therapeutically valuable substances. The compositions 
10 are prepared according to conventional mixing, granulating or coating methods, 
respectively, and contain about 0.1 to 75%, preferably about 1 to 50%, of the 
active ingredient. 

The compounds of the invention can also be administered in such 
oral dosage forms as timed release and sustained release tablets or capsules, pills, 
powders, granules, elixers, tinctures, suspensions, syrups and emulsions. 

Liquid, particularly injectable compositions can, for example, be 
prepared by dissolving, dispersing, etc. The active compound is dissolved in or 
mixed with a pharmaceutically pure solvent such as, for example, water, saline, 
aqueous dextrose, glycerol, ethanol, and the like, to thereby form the injectable 
solution or suspension. Additionally, solid forms suitable for dissolving in liquid 
prior to injection can be formulated. Injectable compositions are preferably 
aqueous isotonic solutions or suspensions. The compositions may be sterilized 
and/or contain adjuvants, such as preserving, stabilizing, wetting or emulsifying 
agents, solution promoters, salts for regulating the osmotic pressure and/or 
buffers. In addition, they may also contain other therapeutically valuable 
substances. 

The compounds of the present invention can be administered in 
intravenous (both bolus and infusion), intraperitoneal, subcutaneous or 
intramuscular form, all using forms well known to those of ordinary skill in the 
pharmaceutical arts. Injectables can be prepared in conventional forms, either as 
liquid solutions or suspensions. 

Parental injectable administration is generally used for 
subcutaneous, intramuscular or intravenous injections and infusions. Additionally, 
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one approach for parenteral administration employs the implantation of a slow- 
release or sustained-released system, which assures that a constant level of dosage 
is maintained, according to US Pat. No. 3,710,795, incorporated herein by 
reference. 

5 Furthermore, preferred compounds for the present invention can be 

administered in intranasal form via topical use of suitable intranasal vehicles, or 
via transdermal routes, using those forms of transdermal skin patches well known 
to those of ordinary skill in that art. To be administered in the form of a 
transdermal delivery system, the dosage administration will, of course, be 
10 continuous rather than intermittent throughout the dosage regimen. Other 
preferred topical preparations include creams, ointments, lotions, aerosol sprays 
and gels, wherein the concentration of active ingredient would range from 0.1% to 
15%, w/worw/v. 

For solid compositions, excipients include pharmaceutical grades 
15 of mannitol, lactose, starch, magnesium stearate, sodium saccharin, talcum, 
cellulose, glucose, sucrose, magnesium carbonate, and the like may be used. The 
active compound defined above, may be also formulated as suppositories using for 
example, polyalkylene glycols, for example, propylene glycol, as the carrier. In 
some embodiments, suppositories are advantageously prepared from fatty 
20 emulsions or suspensions. 

The compounds of the present invention can also be administered 
in the form of liposome delivery systems, such as small unilamellar vesicles, large 
unilamellar vesicles and multilamellar vesicles. Liposomes can be formed from a 
variety of phospholipids, containing cholesterol, stearylamine or 
phosphatidylcholines. In some embodiments, a film of lipid components is 
hydrated with an aqueous solution of drug to a form lipid layer encapsulating the 
drug, as described in US Pat. No. 5,262,564. 

Compounds of the present invention may also be delivered by the 
use of monoclonal antibodies as individual carriers to which the compound 
molecules are coupled. The compounds of the present invention may also be 
coupled with soluble polymers as targetable drug carriers. Such polymers can 
include polyvinylpyrrolidone, pyran copolymer, polyhydroxypropyl- 
methacrylamide-phenol, polyhydroxyethylaspanamidephenol, or 
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polyethyleneoxidepolylysine substituted with palmitoyl residues. Furthermore, the 
compounds of the present invention may be coupled to a class of biodegradable 
polymers useful in achieving controlled release of a drug, for example, polylactic 
acid, polyepsilon caprolactone, polyhydroxy butyric acid, polyorthoesters, 
5 polyacetals, polydihydropyrans, polycyanoacrylates and cross-linked or 
amphipathic block copolymers of hydrogels. 

If desired, the pharmaceutical composition to be administered may 
also contain minor amounts of non-toxic auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and other substances such as for 
10 example, sodium acetate, triethanolamine oleate, etc. 

The dosage regimen utilizing the compounds is selected in 
accordance with a variety of factors including type, species, age, weight, sex and 
medical condition of the patient; the severity of the condition to be treated; the 
route of administration; the renal and hepatic function of the patient; and the 
15 particular compound or salt thereof employed. An ordinarily skilled physician or 
veterinarian can readily determine and prescribe the effective amount of the drug 
required to prevent, counter or arrest the progress of the condition. 

Oral dosages of the present invention, when used for the indicated 
effects, may be preferably provided in any form commonly used for oral dosage 
20 such as, for example, in scored tablets, time released capsules, liquid filled 
capsule, gels, powder or liquid forms. When provided in tablet or capsule form, 
the dosage per unit may be varied according to well known techniques. For 
example, individual dosages may contain 0.5, 1.0, 2.5, 5.0, 10.0, 15.0, 25.0, 50.0, 
100.0, 250.0, 500.0 and 1000.0 mg of active ingredient. It is well known that daily 
25 dosage of a medication, such as a medication of this invention, may involve 
between one to ten or even more individual tables per day. 

The compounds comprised in the pharmaceutical compositions of 
the present invention may be administered in a single daily dose, or the total daily 
dosage may be administered in divided doses of two, three or four times daily. 
30 Any of the above pharmaceutical compositions may contain 0.1- 

99%, preferably 1-70% (w/w or w/v) of the wild type AGR2 polypeptide, the 
proteins and fragments, or the antibodies and their various modified embodiments 
specifically described and claimed herein. 
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If desired, the pharmaceutical compositions can be provided with 
an adjuvant. Adjuvants are discussed above. In some embodiments, adjuvants can 
be used to increase the immunological response, depending on the host species, 
include Freund's (complete and incomplete), mineral gels such as aluminum 
5 hydroxide, surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol, 
and potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) 
and Corynebacterium parvum. Generally, animals are injected with antigen using 
several injections in a series, preferably including at least three booster injections. 

10 

Gene Therapy 

A further aspect of the present invention is a method of gene 
therapy comprising delivering to cells in a human subject suffering from or known 
to be at risk of developing a condition associated with an alteration in goblet cell 

15 function a DNA construct comprising a sequence of an allele of the AGR2 gene 
encoding the human AGR2 protein according to SEQ ID NO:4, or encoding a 
protein having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% 
amino acid identity compared to the mouse Agr2 or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4, respectively; or a sequence of an 

20 allele of the AGR2 gene of a human subject unaffected by or known not to be at 
risk of developing said conditioa 

Also encompassed by the present invention is a method of gene 
therapy of the above kind wherein the DNA construct delivered to the cells of the 
human subject comprises a DNA sequence encoding the human AGR2 protein 

25 according to SEQ ID NO:4, or a human AGR2 protein encoded by the AGR2 
gene of a human subject unaffected by or known not to be at risk of developing 
said condition, or a protein having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 
98%, or 99% amino acid identity compared to the mouse Agr2 or the human 
AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4, respectively. 

30 Furthermore encompassed are methods wherein the DNA construct 

comprises a DNA sequence encoding an antisense nucleic acid according to the 
invention, or an antisense nucleic acid comprising a nucleotide sequence which is 
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complementary to an mRNA encoded by the AGR2 gene of a human subject 
unaffected by or known not to be at risk of developing said condition. 

Also encompassed are methods wherein the DNA construct 
comprises a DNA sequence encoding an siRNA as described and claimed herein. 
5 Alternatively, the DNA construct may comprise a DNA encoding 

an aptamer specifically binding an AGR2 mutein or an AGR2 wild type protein as 
described herein. 

In a further embodiment, the DNA construct may comprise a DNA 
sequence encoding an Agr2 mutein as described herein. 

10 The use of a DNA construct as described above in a method of 

treating a human subject suffering from, or known to be at risk of developing a 
medical condition associated with an alteration in goblet cell function, said 
method comprising delivering said DNA construct to at least some of the cells of 
said human subject, preferably the subject's goblet cells, is also encompassed 

15 within the present invention. 

Method of Modulating AGR2 Activity and Corresponding Uses 

A further aspect of the present invention is a method of preventing, 
treating, or ameliorating a medical condition in a human subject associated with 

20 an alteration in goblet cell fiinction, said method comprising administering to said 
human subject a pharmaceutical composition comprising an agent capable of 
modulating AGR2 activity, i.e., AGR2 mutein or wild type activity, in said human 
subject. The medical condition associated with an alteration in goblet cell function 
as described above and throughout the present description may optionally be 

25 furthermore associated with an increase in proliferation of the glandular 
epithelium of the Brunner's gland. 

The medical conditions may be associated with a decreased mucus 
production, e.g., dry eye syndrome, gastric disease, peptic ulcer, inflammatory 
bowel disease, in particular Crohn's disease or ulcerative colititis, or intestinal 

30 cancer. 

Alternatively, the medical conditions may be associated with an 
increase in mucus production^ejg^asthma, chronic obstructive pulmonary disease 
(COPD), and cystic fibrosis. 

76 



WO 2004/056858 



PCT/EP2003/014834 



The agent capable of modulating AGR2 activity may be one of the 
agents described and specifically claimed herein, e.g., one of the muteins, nucleic 
acids, e.g., nucleic acids encoding the muteins, antisense nucleic acids, siRNAs or 
aptamers directed against or specifically binding to the AGR2 muteins, antibodies, 
5 or small molecule agonists or antagonists of the AGR2 muteins or wild type 
AGR2 protein as described herein. 

It will be appreciated that in situations where the above medical 
condition is caused by a mutation in one of the alleles of the AGR2 gene which 
leads to the expression of an AGR2 mutein with a reduced or abolished activity, 
antisense nucleic acids, siRNA molecules, aptamers, anticalins, or antibodies 
directed against said AGR2 mutein may be therapeutically useful. Alternatively, 
administration of an AGR2 mutein, or a nucleic acid coding therefore, which is 
characterized by an increased AGR2 activity, or administration of a nucleic acid 
capable of leading to an increased AGR2 expression (e.g., of the endogenous 
wild-type AGR2 or of a wild-type AGR2 encoded by said nucleic acid), may 
likewise be therapeutically useful in this regard. 

In situations where an excess amount or activity of the endogenous 
AGR2 protein is the cause of the above medical condition, administration of an 
AGR2 mutein, or nucleic acid coding therefore, which is characterized by a 
decreased AGR2 activity, or administration of a nucleic acid capable of leading to 
a decreased AGR2 expression (e.g., of an endogenous mutated or a wild-type 
AGR2) may likewise be therapeutically useful in this regard. 

It will be appreciated that agents relating to the wild type AGR2 
protein will likewise be advantageously administered to a human subject suffering 
from a condition as mentioned above, e.g., in situations where a reduced amount 
or activity of the endogenous AGR2 is the cause of the above medical condition in 
the human subject. Accordingly, it will be appreciated that a wild type AGR2 
protein may advantageously be administered to a human subject suffering from 
such a condition, or a protein having a certain amino acid sequence identity and 
showing the same, or essentially the same, biological activity in any of the in vitro 
assays mentioned herein before (or a fragment or fusion of such protein). Proteins 
suitable in this regard may be readily determined, e.g., with the help of these in 
vitro assays. 
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It will also be appreciated that in situations where an excess of 
endogenous AGR2 protein or activity is the cause of the medical condition in the 
human subject, antisense nucleic acids, siRNAs molecules, aptamers, anticalins, 
or antibodies against said AGR2 wild type protein, may be therapeutically used. 
5 It will be understood that the skilled person may use the in vitro 

assays as described herein in order to identify the activity of a given AGR2 mutein 
or the effect of an agent relating to such an AGR2 mutein or AGR2 wild type 
protein. Based on this information, the skilled person will be readily able to 
choose and identify the appropriate agent in connection with the disease situation 
10 to be treated. 

Assays and Diagnostics 

The animals of the present invention present a phenotype whose 
characteristics are representative of many symptoms associated with disorders of 
15 altered mucus production and/or function, therefore making the animal model of 
the present invention a particularly suitable model for the study of these diseases 
including asthma, chronic obstructive pulmonary disease (COPD), cystic fibrosis, 
dry eye syndrome, gastric disease, peptic ulcer, inflammatory bowel disease and 
malignancies like colorectal cancer. 

The animals of the present invention can also be used to identify early 
diagnostic markers for diseases associated with AGR? deficiency. The term 
deficiency refers to an alteration of protein function in both positive (= gain of 
function) and negative (= loss of function) ways. Surrogate markers, including but 
not limited to ribonucleic acids or proteins, can be identified by performing 
procedures of proteomics or gene expression analysis known in the art. For 
example procedures of proteomics analysis include, but are not restricted to, 
ELISA, 2D-gel, protein microarrays or mass spectrophotometric analysis of any 
organ or tissue samples, such as blood samples, or derivatives thereof, preferably 
plasma, at different age or stage of AGR2 activity deficiency or activity increase 
associated disease development, or symptom thereof As a further example, gene 
expression analysis procedures include, but are not restricted to, differential 
display, cDNA microarrays, analysis of quality and quantity of ribonucleic acids 
species from any organ or tissue samples, such as blood samples, or derivatives 
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thereof, at different age or stage of development of AGR2 activity deficiency 
associated disease, or symptom thereof. 

The animal model of the present invention can be used to monitor 
the activity of agents useful in the prevention or treatment of the above-mentioned 
5 diseases and disorders. The agent to be tested can be administered to an animal of 
the present invention and various phenotypic parameters can be measured or 
monitored. In a further embodiment the animals of the invention may be used to 
test therapeutics against any disorders or symptoms that have been shown to be 
associated with AGR2 deficiency or over-expression. 

10 The animals of the present invention can also be used as test model 

systems for materials, including but not restricted to chemicals and peptides, 
particularly medical drugs, suspected of promoting or aggravating the above- 
described diseases associated with AGR2 deficiency. For example, the material 
can be tested by exposing the animal of the present invention to different time, 

15 doses and/or combinations of such materials and by monitoring the effects on the 
phenotype of the animal of the present invention, including but not restricted to 
change of goblet cell function, namely proper mucin production. Furthermore, the 
animals of the present invention may be used for the dissection of the molecular 
mechanisms of the AGR2 pathway, that is for the identification of receptors or 

20 downstream genes or proteins thereof regulated by AGR2 activity and deregulated 
in AGR2 activity deficiency or activity increase associated disorders. For 
example, this can be done by performing differential proteomics analysis, using 
techniques including but not restricted to 2D gel analysis, protein chip microarrays 
or mass spectrophotometry, on tissues of the animal of the present invention 

25 which express AGR2 and which respond to AGR2 stimuli. 

An exemplary method for detecting the presence or absence of 
AGR2 mutein in a biological sample involves obtaining a biological sample from 
a test subject and contacting the biological sample with a compound or an agent 
capable of detecting AGR2 protein or nucleic acid (e.g., mRNA, genomic DNA) 

30 that encodes AGR2 mutein such that the presence of AGR2 is detected in the 
biological sample. An agent for detecting AGR2 mutein mRNA or genomic DNA 
is a labeled nucleic acid probe capable of hybridizing to AGR2 mutein mRNA or 
genomic DNA. 
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The diagnostic methods described herein can furthermore be 
utilized to identify subjects having or at risk of developing a disease or disorder 
associated with aberrant AGR2 expression or activity. For example, the assays 
described herein, such as the preceding diagnostic assays or the following assays, 

5 can be utilized to identify a subject having or at risk of developing a disorder 
associated with AGR? protein, nucleic acid expression or activity. Alternatively, 
the prognostic assays can be utilized to identify a subject having or at risk for 
developing a disease or disorder. Thus, the invention provides a method for 
identifying a disease or disorder associated with aberrant AGR2 expression or 

10 activity in which a test sample is obtained from a subject and AGR2 protein or 
nucleic acid (e.g., mRNA, genomic DNA) is detected, wherein the presence of 
AGR2 protein or nucleic acid is diagnostic for a subject having or at risk of 
developing a disease or disorder associated with aberrant AGR2 expression or 
activity. As used herein thoughout the entire specification, a "test sample" refers 

15 to a biological sample obtained from a subject of interest. For example, a test 
sample can be a biological fluid (e.g., blood, plasma, serum), cell sample, or tissue 
sample. 

Furthermore, the prognostic assays described herein can be used to 
determine whether a subject can be administered an agent (e.g., an agonist, 
20 antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or 
other drug candidate) to treat a disease or disorder associated with aberrant AGR2 
expression or activity. For example, such methods can be used to determine 
whether a subject can be effectively treated with an agent for a disorder. 

Agents, or modulators that have a stimulatory or inhibitory effect 
on AGR2 activity (e.g., AGR2 gene expression), as identified by a screening assay 
described herein can be administered to individuals to treat (prophylactically or 
therapeutically) AGR2-mediated disorders. Differences in metabolism of 
therapeutics can lead to severe toxicity or therapeutic failure by altering the 
relation between dose and blood concentration of the pharmacologically active 
drug. Thus, the pharmacogenomics of the individual permits the selection of 
effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a 
consideration of the individuals genotype. Such pharmacogenomics can further be 
used to determine appropriate dosages and therapeutic regimens. Accordingly, the 
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activity of AGR2 protein, expression of AGR2 nucleic acid, or mutation content 
of AGR2 genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. 

The present invention also provides a diagnostic method for AGR2 
5 activity deficiency or activity increase. Patients' peptide material, particularly that 
in or from blood, serum or plasma, is subjected to analysis for one or more of the 
amino acid sequences of the present invention. The peptide material may be 
analyzed directly or after extraction, isolation and/or purification by standard 
methods. 

10 In one embodiment of the invention, the diagnostic method 

comprises the identification of the modified AGR2, whereby the modification is 
associated with the replacement of an amino acid at a position corresponding to 
position 137 in the amino acid sequence shown in SEQ ID NO:4. The diagnostic 
methods of the invention also include those employing detection of the modified 

15 AGR2 by its activity in competing with and blocking the action of native AGR2. 
Methods of identifying the modified AGR2 include any methods known in the art 
which are able to identify altered conformational properties of the amino acid 
sequence of the present invention compared to those of the wild type AGR2. 
These include, without limitation, the specific recognition of the modified protein 

20 by other proteins, particularly antibodies; individual or combined patterns of 
amino acid sequence digestion by known proteases or chemicals. In an additional, 
similar embodiment, the method exploits the failure of another protein to 
recognize the modified protein, examples being antibodies directed to an epitope 
of wild type AGR2 that incorporates residue 137 of SEQ ED NO:4, and AGR2 

25 receptors in which this portion of the molecular surface of wild type AGR2 is 
recognized or involved in AGR2. 

In a further embodiment of the present invention, the principle of 
the diagnostic method is the detection of a nucleic acid sequence encoding the 
modified AGR2 of the invention. This includes, but is not restricted to any 

30 methods known in the art using nucleic acid hybridizing properties, such as 
Polymerase Chain Reaction (PCR), Northern blot, Southern blot, nucleic acid 
(genomic DNA, cDNA, mRNA, synthetic oligonucleotides) standard methods 
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employing microarrays, and patterns of nucleic acid digestion by known 
restriction enzymes. 

The invention will be further described in the following examples, 
which do not limit the scope of the invention described in the claims. The 
5 following examples are offered for illustrative purposes only, and are not intended 
to limit the scope of the present invention in any way. 

Other features and advantages of the invention will be apparent from the 
following examples. 

Example 1: ENU (Ethyl-nitroso-urea) Treatment to Produce Mutagenized 
10 Animals 

To produce mutants, a C3HeB/FeJ male mouse (The Jackson 
Laboratory, Bar Harbor ME, U.S.A.) was injected intraperitoneally three times 
(weekly intervals between 8-10 weeks of age) with ethyl-nitroso-urea (ENU) 
(Serva Electrophoresis GmbH, Heidelberg, Germany) at a dosage of 90mg/kg 
15 body weight. The injected male mouse was regularly mated to wild type 
C3HeB/FeJ female partners fifty days after the last injection. The resultant Fl 
progeny (up to 100 offspring) were then analyzed for dominant phenotypes. 

Generation of F3 Progeny - Breeding Scheme 

F3 progeny are generated using the breeding scheme shown in 
20 Figure 3A. All breeding partners were older than 8 weeks); preferably females 
were between 8-12 weeks of age and males were between 8-16 weeks of age. 

Production of Fl-animals fdbl) 

Each ENU-male produced as described above is used to generate 
more than 30 male and 30 female pups, which were interbred as described below. 

25 Production of F2-animals (rfl) 

Each week, 20 matings are set up as follows: (1 male Fl(dbl) x 1 
female Fl(dbl) to produce 20 pedigrees. The animals of one breeding pair are 
pups of different ENU-animals (mating type: rfl). 
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Production of F3 -animals (rbs) 

8 weeks rfl animals are mated in single F2 (1 male) x F2 (1 
female) - breeding per pedigree (mating type: rbs). From each rbs-breeding, at 
least 15 offspring are produced. Rfl -females are kept until the youngest rbs 
5 animals have been screened (age =160 days). Rfl -males are sacrificed and frozen 
after the number of 15 offspring has been reached. F3 animals are analyzed in the 
primary screen. 

We performed a series of tests on F3 animals as a primary screen to 
identify relevant phenotypes. For this invention, observation of diarrhea and 
10 results of a routine histological examination provided information to identify an 
aberrant phenotype within the F3 population. 

Example 2: Physiological Characteristics of the Mutant Animals 

The macroscopic evaluation indicates that 100% of the 
homozygous MTZ offspring in a C3H inbred background developed a 
15 macroscopically visible diarrhea and a thriving deficit Thriving deficit is 
manifested in reduced weight in combination with reduced body length, when 
compared to wild type littermates. 

Example 3: Necroscopy and Organ Histology of the Mutant Animals 

The visible diarrhea and the thriving deficit led to the subsequent 

20 investigation of the intestinal organs of the MTZ mouse. 

For example, a histological examination of hematoxilin/eosin 
stained (Figure 1 1), or of lectin stained (Figure 13) colon wall sections from MTZ 
affected animals depict a strong reduction in pre-mucin storing granules in goblet 
cells, resulting in reduced mucus secretion and secondary inflammatory 

25 infiltrations in the colon mucosal epithelium and submucosa (marked by an 
asterisk in Figure 12). Additionally, microerosion of colonic mucosa is detectable 
(marked by an arrow in Figure 12). Paneth cells and enterochromafEn cells are not 
affected. 

The observation that absence of normal XGR2 protein leadTT<r"~ 
30 dysfunctional goblet cells is be to extended to other mucosal organs expressing 
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murine Agr2 mRNA, such as the eye, nose, trachea, lung, esophagus, salivary 
gland, stomach, intestine, rectum, thymus, testis, epididymis, uterus and placenta, 
as determined by RT-PCR, and as described in Example 6, and as shown in Figure 
6. Northern analysis of human mRNA confirmed the expression of Agr2 mRNA 
5 in all goblet cell carrying tissues and organs of the gastrointestinal tract, of the 
respiratory tract and in prostate and cervix, as described in Example 8, and as 
shown in Figure 8. 

In addition to the goblet cell phenotype described for MTZ colon, affected 
mice display a dilated Brunner's gland with increased proliferating glandular 
10 epithelium. Duodenal epithelium closely located to the Brunner's gland is 
characterized by loss of goblet cells, proliferated epithelium and signs of slight 
inflammation, as shown in Figure 14. AGR2 mRNA expression in Brunner's 
glands was detected by RNA in ^-hybridization technique. 

Example 4: Mapping and Cloning of the Mutation in the Mutant Animals 
15 of the Present Invention 

1- Generation of F5 Outcross Mice for Subsequent Chromosome Mapping 

F5 progeny are generated according to the scheme illustrated in 
Figure 3B - this entails breeding a phenotypically identified F3 mutant with 
C57B1/6 mice for generation of F4 outcross mice. F4 progeny are then 
20 intercrossed to produce an F5 generation. The F5 generation is phenotyped 
according to the previously described parameters. Starting with two F3 animals of 
the MTZ pedigree we generated 40 F4 animals (22 males, 1 8 females) and 236 F5 
animals (115 males, 121 females). The F5 outcross mice were used to locate the 
MTZ phenotype causing ENXJ mutation in the mouse genome. 

25 2. DNA Isolation from Rodent Tails 

Mouse genomic DNA was purified from 1 cm long pieces of mice 
tail by using the "DNeasy 96 Tissue Kit" (Qiagen, Hilden, Germany) according to 
the manufacturer's protocol. 
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3. Macromapping 

In F5 outcross mice allele frequencies of C57BI/6 versus C3H 
alleles are 1:1 in average, following Mendelian rules of inheritance. Arrangement 
in groups of phenotypic positive and phenotypic negative mice alters this ratio 
5 only at marker positions in the vicinity of the phenotype causing mutation driving 
it towards 0:1 in the phenotypic positive group and 1:0 in the phenotypic negative 
group. Allele frequency analysis of distributed genome covering markers (e.g., 
SSLP, SNP) in a group of phenotype positive F5 outcross mice indicate the site of 
the mutation as values for the C3H:C57Bl/6: ratio increase above 3. 

For the MTZ mice we analyzed for a chromosomal locus with 
increased allele frequency for single nucleotide polymorphisms (SNPs) 
representing the C3H strain. Markers in this analysis are 90 SNPs polymorphic 
between C3H and C57B1/6 strains, equally distributed over the 19 autosomal 
mouse chromosomes. Analysis was done in two steps at pooled tail DNA samples 
of 14 F5 outcross mice positive for the MTZ phenotype. First: competitive PCR, 
followed by second: SNP allele frequency measurement from the PCR product 
mix by Pyrosequencing technology (PSQ 96 system; http://www.pyrosequen- 
cing.com/pages/applications.html). 

Pooled tail DNA (1ml 10^g/ml: 10|xg/14 mice= 0.71ng/mouse 
(concentration roughly judged and adjusted by agarose gel comparison to 
standard), pooled, ad 1ml) was distributed in a 96-well plate with predeposited 
SNP marker PCR primers (one SNP/well). A standard PCR reaction was 
performed (50|nl vol.). One of both SNP primers was biotinylated, which is 
necessary for the subsequent single strand PCR product purification in the 
Pyrosequencing procedure. Purification of a single stranded (ss) PCR product and 
short range sequencing the SNP positions on the ss PCR product was performed 
according to the instructions supplied with the Pyrosequencing kit (PSQ 96 SNP 
Reagent Kit, 5x96). The resulting peaks at the polymorphic bp positions of the 
SNP sequence correlate to the amount this allele had in the original DNA pool and 
were exported from the PSQ 96 databank and processed into an Excel macro. 

The Excel macro calculated the C3H/BL6-peakhight ratio at every 
SNP position according to the formula: 

(peaHiight C3 VakW^ Constant individuflJSNP serves to 
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improve C3H/BL6-peakhight ratio comparability among different SNP positions 
and is an average value for pea]diight a %eakhight BL6 of a heterozygous 
C3H/C57B1/6 mouse (Fl outcross mouse). This value was determined 
experimentally afore for every individual SNP from nine (triplicates on three 

5 days) measurements and is expected to be close to 1 in theory but often differs 
from 1 in practice. Finally the Excel macro delivered a graphical output from the 
calculated B16/C3H-peakhight ratios (Figure 4) in which regions with values 
above 3 indicate the chromosomal position of the mutation. 

The output for MTZ phenotype positive DNA pool analysis 

10 showed high values above 3 at chromosome 12 and assigned the mutation to 
chromosome 12, 0-30 cM. 

4. Fine Mapping 

The initial mapping was confirmed on single mouse level 
haplotype analysis of a total of 236 F5 outcross MTZ mice using microsatellite 
15 markers located in the critical region on chromosome 12. Successively the 
candidate region mapping was refined, based on mice that carry chromosomal 
break points in the respective region. Finally the analysis narrowed the location of 
the mutation to an interval of approximately 25.7 Mbp between the SNP marker 
Idb2 (SEQ ID No:ll, primer SEQ ID No:12, 13) and D12Mit64 (SEQ ID No:14, 
20 primer SEQ ID No: 15, 16). This was evident since MTZ mouse #764 (phenotype 
positive) excluded the region proximal of Idb2, while MTZ mice #799 and #899 
(both phenotype positive) excluded the region distal of D12Mit64 (Figure 5). This 
results into the conclusion that a gene located entirely or partially between these 
markers could contain the mutation. 
25 The genomic interval between markers Idb2 and D12Mit64 was 

scanned for genes by a detailed analysis of public mouse and human genome 
databases. Several annotated mouse genes were recorded within this region. Of 
these, the identified AGR2 gene was considered one of the most relevant 
candidate genes to search for the mutation, as it was known to be expressed in 
goblet cells. 
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5. PCR Amplification and Sequencing o f Mouse Agr2 Gene 

The genomic structure, precise location of AGR2 exons and a 
putative fiill length cDNA (SEQ ID No:6), containing the open reading frame 
coding for the AGR2 protein (SEQ ID No:3), an poly adenylation signal, and a 

5 polyA stretch was deduced from a public available mouse Agr2 cDNA sequence 
(Genbank accession number NM_011783) and from genomic mouse DNA data 
(Ensemble, Feb 2002 freeze of the mouse assembly). The same was done for 
human AGR2 (Genbank accession number NM_006408). For mouse Agr2, 8 
exons could be defined (see Figure IB) that very closely resemble the human 

10 AGR2 gene in respect to size, sequence, genomic context and chromosomal exon 
distribution, suggesting evolutionary conserved functions for mouse and human 
AGR2 (see Figure 1). 

Genomic DNA fragments of AGR2 gene were obtained by PCR 
using BioTherm-DNA-polymerase (GeneCraft, Germany) according to the 

15 manufacturer's protocol. Oligonucleotide primers were designed using a publicly 
available primer design program (Primer 3, www.genome.wo.mit.edu) to generate 
a series of oligonucleotide primers specific for AGR2 exons. Primers used for 
amplification are shown in SEQ ID NO:17 to SEQ ID NO:28. (Primers SEQ ID 
No: 17 and 18 were used to amplify exon 2, SEQ ID NO: 19 and 20 were used to 

20 amplify exon 3+4, SEQ ID NO:21 and 22 were used to amplify exon 5, SEQ ID 
NO:23 and 24 were used to amplify exon 6, SEQ ID NO:25 and 26 were used to 
amplify exon 7, SEQ ID NO:27 and 28 were used to amplify exon 8, exon 1 was 
not sequenced, since it is a noncoding exon). PCR amplified products were 
purified using the QIAquick PCR Purification Kit (Qiagen, Hilden, Germany) 

25 according to the manufacturer's protocol. PCR products were sequenced using 
forward/reverse PCR primers and the "Big Dye" thermal cycle sequencing Kit 
(ABI PRISM, Applied Biosystems, Foster City, CA, U.S.A.). The reaction 
products were analyzed on an ABI 3700 DNA sequencing device. 

6. Sequence Analysis 

30 The sequences were edited manually and different sequence 

fragments were assembled into one contiguous sequence the software Sequencer 
version 4.0.5. (Gene Codes Corp., Ann Arbor MI, U.S.A.). We sequenced the 

87 



WO 2004/056858 



PCT/EP2003/014834 



AGR2 gene in MTZ phenotype positive homozygous F2 outcross mice as well as 
heterozygous mice . In both cases, C3H and C57B1/6 mice sequences were used as 
controls. The sequencing results showed that exons 2-6 and exon 8 were free of 
any mutation. However, a single bp exchange in exon 7 changing the underlined T 
5 in sequence ATCCCTGACGGTGAGGGCAGAC (see SEQ ID NO:6) to A (see 
SEQ ID NO:l), resulting in an A/T double peak in the heterozygous mice and a 
pure A in the homozygous MTZ mice. The mutation was confirmed in all MTZ 
phenotype positive mice tested. Sequencing the coding region from other genes in 
the candidate region showed that those were free of any additional mutation. 
10 As a consequence of the identified mutation the codon GTG is 

changed to GAG and the mutated AGR2 protein carries a charged glutamic acid 
(E) in position 137 instead of the non polar valin (V) in the wild type (non 
mutated) protein. 

Example 5 Method for Production of the Mutant Animals of the Present 
15 Invention by Gene Targeting Technology. 

The construction of a recombinant targeting vector to insert a point 
mutation in exon 7 of the mouse Agr2 gene may be performed according to well 
known techniques. For example the Lambda-KO-Sfi system of Nehls and Wattler, 
WO 01/75127. 

20 1. Vector Construction 

In a first step, a l,5kbp genomic DNA fragment is PCR amplified, 
representing the left arm of homology of the targeting vector to be constructed. 
After subsequent subcloning of the PCR fragment into a plasmid vector, i.e. pCR 
2.1-TOPO (K4500-01, Invitrogen, Carlsbad, California, USA), according to the 

25 manufacturer's instructions, plasmid DNA, bearing the correct AGR2 insert is 
subject to site-directed mutagenesis, using a QuickChange Site-Directed 
Mutagenesis Kit (200518, Stratagene, La Jolla, California, USA), as outlined in 
the manufacturer's instructions. In brief, the plasmid vector (parental DNA 
template) and two oligonucleotide primers, each primer complementary to 

30 opposite strands of the vector insert and containing the desired point mutation 
(exon 7, position 462 of AGR2 cDNA), are denatured and subject to PCR 
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amplification with a proof-reading DNA polymerase (Pfii Turbo), provided in the 
kit. Using the non-strand displacing action of Pfu Turbo DNA polymerase, 
mutagenic primers are incorporated and extended, resulting in nicked circular 
DNA strands. In a restriction digest with Dpnl, only the methylated parental DNA 

5 template is susceptible to Dpnl digestion. After transformation in XL 1 -Blue 
supercompetent cells, provided with the kit, nicks in the mutated (point mutation) 
plasmid DNA are repaired. Mutation positive colonies are selected and plasmid 
DNA is isolated, according to the manufacturer's instructions (Stratagene, La 
Jolla, California, USA). 

10 Plasmid DNA, bearing the point mutation in exon 7, as described 

in the present invention, is subject to PCR amplification with primers, bearing 
SfiC and SfiA sequence overhangs, respectively, as described in the published 
patent application WO 01/75127. The PCR fragment, representing the left arm of 
homology is further processed, as described in the aforementioned patent 

15 application. The vector described in WO 01/75127, includes a linear lambda 
vector (lambda-KO-Sfi) that comprises a stuffer fragment, an E. coli origin of 
replication, an antibiotic resistance gene for bacteria selection, two negative 
selection markers suitable for use in mammalian cells, and LoxP sequences for 
cre-recombinase mediated conversion of linear lambda phages into high copy 

20 plasmids. In a final lambda targeting vector, the stuffer fragment is replaced by Sfi 
A,B,C,D ligation of the left arm of homology (bearing the AGR2 point mutation 
in exon 7), an ES cell selection cassette, and a right arm of homology, as 
described in the aforementioned patent application. In-vitro packaging of the 
ligation products, plating of a phage library, plasmid conversion, and DNA 

25 isolation of the homologous recombination plasmid vector is performed according 
to standard procedures, known by persons skilled in the art. 

2. ES cell transformation and mice production. 

Targeting vectors containing the point mutation are used for mouse 
ES cell transformation and to producing chimeric mice by blastocyst injection and 
30 transfer using standard methodology, well known in the art. The chimeras are bred 
to wild type mice to determine gennline transmission. Heterozygotes and 
subsequently homozygotes are generated according to well known techniques. 
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Example 6 Expression of murine AGR2 

To identify the cellular RNA expression pattern of the murine 
AGR2 gene, reverse transcribed polymerase chain reaction (RT-PCR) was 
employed. A tissue cDNA panel of 48 different tissues or developmental stages of 
5 the mouse was used, comprising the following tissues: total brain, cerebrum, 
cerebrum left hemisphere, cerebrum right hemisphere, cerebellum, medulla 
oblongata, medulla spinalis, thyreoidea/trachea, olfactory lobes, lung, tongue, 
esophagus, salivary gland, stomach, pituitary gland, pancreas, small intestine, 
large intestine, eye, appendix, nose epithelium, rectum, trachea, thymus, heart, 
uterus, mesenterium, placenta, gall bladder, sternum, liver, bone marrow, spleen, 
whole blood, kidney, skin, adrenal gland, adipose tissue, bladder, skeletal muscle, 
testis, Es-cells, epididymis, prostate, embryo d 5,5, embryo d 9,5, embryo d 13,5 
head, embryo d 13,5 body, embryo d 18,5 head, embryo d 18,5 body, embryo d 10 
- 12 (Ambion), cDNA pool, plus a negative (water) control: 

The primers used are the following: mAgr2-7 5 X - 
CAGACCCTTGATGGTCATTC; SEQ ID NO:7, mAgr2-2 5 % - 
GTCTCCTGACCCGGTGCGCAG; SEQ ID NO:8. The PCR product of 349 bp in 
length represents a PCR product specific for mouse Agr2, as verified by sequence 
analysis. Expression of mouse AGR2 was identified in the following cells and 
organs: medulla oblongata, eye, nose epithelium, trachea, thyreoidea, lung, 
esophagus, salivary gland, stomach, small intestine, large intestine, appendix, 
rectum, gall bladder, testis, epididymis, uterus, placenta, embryo at day 5.5 and 
embryo at day 13.5, as seen in Figure 6. 

Example 7 Expression of human AGR2 

To identify the cellular RNA expression pattern of the human 
AGR2 gene, reverse transcribed polymerase chain reaction (RT-PCR) was 
employed. A tissue cDNA panel of 29 different tissues from human was used, 
comprising the following tissues: total brain, cerebellum, trachea, lung, 
esophagus, stomach, salivary gland, pancreas, colon, rectum, thymus, heart, 
pericardium, liver, fetal liver, spleen, kidney, adrenal gland, bladder, uterus, 
cervix, placenta, breast, mammary gland, testis, prostate, skin, adipose tissue, 
skeletal muscle. The primers used are the following: hAGR2-l 5*- 
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GAACCTGCAGATACAGCTCTG; (SEQ ID N0:9) hAGR2-4 5*- 
CACACTAGCCAGTCTTCTCAC; (SEQ ID NO:10). The PCR product 170 bp in 
length represents the PCR product specific for human AGR2, as verified by 
sequence analysis. Strong expression of human AGR2 was identified in the 
5 following tissues: trachea, stomach, salivary gland, colon, rectum, kidney, uterus, 
cervix, mammary gland, prostate, as seen in Figure 7. 

The tissue specific expression profile of both genes, mouse AGR2 and human 
AGHR2, is very similar. 

10 Example 8 Tissue-specific Expression of human Agr2 mRNA, analyzed by 
Northern Hybridization, 

Northern hybridization of polyA* RNAs from several human 
tissues was carried out using a human AGR2 specific DNA probe. The probe was 
generated by radiolabeling a purified and sequence-verified PCR product 

15 generated by using primers hAgr2-3 (SEQ ID NO:31) and hAgr2-4 (SEQ ID 
NO:32), comprising the open reading frame of AGR2. The probe is 532 bp in 
length (see SEQ ID NO:33). Commercially available Multiple Tissue Northern 
Blots (4 different MTN blots (MTNl, MTN2, MTN3, MTN4) of BioChain 
Institute, Hayward CA, USA) each containing 3 micrograms of poly A + RNA per 

20 lane; Human Digestive System 12 lane MTN (MTNl 2) blot by Ciontech/Becton 
Dickinson, San Jose, USA, each lane containing 3 micrograms of poly A + RNA) 
were hybridized, following the manufacturer's instructions. These blots are 
optimized to give best resolution in the 1.0-4.0 kb range, and marker RNAs of 9.5, 
7.5, 4.4, 2.4, 1.35 and 0.24 kb were run as reference. Membranes were pre- 

25 hybridized for 30 minutes and hybridized overnight at 68°C in ExpressHyb 
hybridization solution (Clontech Laboratories, Palo Alto CA, USA) as per the 
manufacturer's instructions. The DNA probe used was labeled with [a 32 P] dCTP 
using a random primer labeling kit (Megaprime DNA labeling system; 
Amersham Pharmacia Biotech, Piscataway NJ, USA) and had a specific activity 

30 of 1 x 10 9 dpm/jig. The blots were washed several times in 2x SSC, 0.05% SDS 
for 30-40 minutes at room temperature, and were then washed in O.lx SSC, 0.1% 
SDS for 40 minutes at 50°C (see Sambrook et al y 1989, "Molecular Cloning, A 
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Laboratory ManuaV\ Cold Spring Harbor Press, New York, USA). The blots 
were covered with standard domestic plastic wrap and exposed to X-ray film at - 
70°C with two intensifying screens for 18 hours. 

The tissues represented in the Clontech/Becton-Dickinson and in 
5 the BioChain Institute Multiple Tissue Northern Blots are as follows: 
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20 

The results of this experiment indicate that human AGR2 mRNA is 
strongly expressed in stomach, duodenum, ileocecum, ileum, descending colon, 
transverse colon, caecum, and rectum. Weaker expression is detected in lung, 
cervix, and prostate (see Figure 8). The usage of two different polyadenylation 
25 signals leads to AGR2 transcripts of 950 nucleotides and of 1800 nucleotides in 
lengths. 

Example 9 Characteristics of human and mouse AGR2 Protein and tissue 
specific Expression. 

The human orthologue of the mouse Agr2 protein, human AGR2 
30 protein, has a length of 175 amino acid residues (in comparison to 175 amino acid 
residues for the corresponding mouse protein). Figure 2 represents an amino acid 
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alignment of mouse Agr2 and human AGR2, indicating an amino acid identity of 
9 1 %, indicating that these are orthologues. 

Murine Agr2 protein was detected in goblet cells, using an anti- 
murine Agr2 antiserum, as described in Example 11, and as shown in Figure 10. 
5 Goblet cell specificity was confirmed with an anti-TFF3 antibody (kindly 
provided by W. Hoffmann, Universitatskiinikum Magdeburg, Germany). In situ- 
hybridization confirmed Agr2 protein expression in Brunner's glands (data not 
shown). 

Example 10 Cloning of mouse and human AGR2 into Expression Vectors. 

To express wild type or mutant AGR2 in bacteria or eukaryotic 
cells, the cDNA can be cloned into a expression vector using standard cloning and 
transfection techniques, as described, for instance, in Sambrook et al. (eds.), 
Molecular Cloning: A Laboratory Manual (2 nd Ed.), Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989; and Ausubel et al. (eds.), 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 
NY, 1993. A preferred method is the cDNA subcloning into expression vectors of 
the Gateway cloning and expression system (Invitrogen, California, USA), 
according to the manufacturer's instructions. 

Purification of recombinant AGR2 from host cells can be 
performed using standard methods well-known to those skilled in the art. For 
standard references, see above. 

Example 11 Method for the Production of Antibodies specific for AGR2 
Epitopes. 

The production of antibodies specific for AGR2 was performed 
according to well known techniques, as described for example herein or in Paul 
Suhir, Antibody engineering Protocols, Humana Press, 1995 and William C. 
Davis (ed), Monoclonal antibody production, Humana Press 1995. 

1. Preparation of antigens 

To obtain antigen for the immunization of animals, recombinant 
AGR2 proteins or fragments thereof may be expressed in pro — or eukaryotic cells 
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and purified from the cell lysates according to standard techniques as described 
for example in Joseph Sambrook et al., Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press; 3 rd ed. 2001), and as described in Example 
10. Alternatively, specific peptides with approximately up to ~ 60, preferably 15 
5 to 25 residues with a sequence identical to parts of AGR2, were synthesized and 
coupled to keyhole limpet hemocyanin (KLH) or bovine serum albumin (BSA) 
via an additional cysteine at the C- or N-terminus as described in Schn5lzer et al. 
(1992). Peptides for immunizations can be derived from any part of the amino 
acid sequence of AGR2, preferably from regions with high probability for 
10 localization on the surface of the protein (as predicted for example with the 
sequence analysis tools of The European Molecular Biology Open Software Suite) 
and with low sequence homology to other known proteins, preferably the peptide 
TVKSGAKKDPKDSRPKLPQ (SEQ ID NO:34) 

2. Immunization 

15 For the production of antibodies in animals, the synthetic peptides 

coupled to a carrier protein or the purified recombinant protein were injected 
subcutaneously into an animal. For a mouse or rabbit, 100 to 200 ng of antigen 
were used. Antigen were dissolved in a suitable adjuvant, preferably Complete 
Freund's Adjuvant (Sigma, St. Louis, MO, USA) for the initial injection, and 
20 Freund's Incomplete Adjuvant (Sigma) for all subsequent injections, to a final 
volume of about 200^1 per animal. 

Booster injections were given after several weeks, perferably 5, 9 
and 13 weeks after the first injection. Shortly after the fourth injection, preferably 
after ten days, the animals were anesthesized and killed by heart punctation. Sera 
we re separated. 

Example 12 Western Blot Analysis. AGR2 is a secreted Protein released 
from cultured Colon Cancer Cells. 

Western blot analysis was performed as described in Ausubel et al. 
(eds.), Current Protocols in Molecular Biology, John Wiley & Sons, New 
York, NY, 1993. AGR2 protein was detected using the anti-murine AGR2 
antiserum, as described in Example 1 1. The human colon cancer cell lines Caco-2 
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(ATCC No. HTB-37), HT-29 (ATTC No. HTB-38), and LS 174T (ATTC No. 
CL-188) endogenously express human AGR2 protein. In contrast the simian 
fibroblastoid cell line COS-7 (ATTC No. CRL-1651) does not express detectable 
amounts of AGR2 protein. (See Figure 20, IP (immunoprecipitated) cell pellet.) In 
5 the present example, lxlO 7 cells were lysed in 1ml detergent lysis buffer 
containing 1% NP-40, 25mM Tris pH 7.5, 150mM NaCl and 5mM EDTA. 
Protein concentrations were determined and amounts of lysate corresponding to 
30jig of total protein were resolved by SDS-PAGE. After blotting on 
nitrocellulose membranes, AGR2 protein was detected using an AGR2 specific 
10 rabbit antiserum (1:1000 fold dilution in TBST) and a secondary, peroxidase- 
coupled anti-rabbit IgG reagent. Visualization was achieved by 
chemiluminescence. 

AGR2 is a secreted protein, since AGR2 protein is detected in 
supernatants conditioned from HT-29 and LS174T, respectively, after supernatant 
15 concentration and immunoprecipitation using the before-mentioned anti-murine 
AGR2 antiserum, as shown in Figure 20. Supernatants have been conditioned for 
1 day and 3 days, respectively (IP Id conditioned supernatant, IP 3d conditioned 
supernatant). AGR2 protein is also detectable in the lysates cell pellet. In the 
present example, 20 ^il of a Mon-1 specific rabbit antiserum were added to 10 ml 
of culture supernatants conditioned by lxlO 7 cells for 24 and 72 hours, 
respectively. Following incubation, immunocomplexes containing Mon-1 protein 
were collected by adding immobilized protein A and resolved by SDS-PAGE, 
Immunoprecipitated Mon-1 protein was detected as described above. 

Example 13 Gene Therapy 

A number of viruses, including retroviruses, adenoviruses, herpes 
viruses, and pox viruses, have been developed as live viral vectors for gene 
therapy. A nucleic acid that encodes for mutated AGR2 protein (SEQ ID NO:30) 
or wild type AGR2 protein (SEQ ID NO:4) is inserted into the genome of a parent 
virus to allow them to be expressed by that virus. This is accomplished by first 
constructing a DNA donor vector for in vivo recombination with a parent virus. 

The DNA donor vector contains (i) a prokaryotic origin of 
replication, so that the vector may be amplified in a prokaryotic host; (ii) a gene 
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encoding a marker which allows selection of prokaryotic host cells that contain 
the vector (e.g., a gene encoding antibiotic resistance); (iii) at least one gene 
encoding a desired protein located adjacent to a transcriptional promoter capable 
of directing the expression of the gene; and (iv) DNA sequences homologous to 
5 the region of the parent virus genome where the foreign gene(s) will be inserted, 
flanking the construct of element (iii). 

The donor vector further contain additional genes which encodes 
one or more marker which will allow identification of "recombinant viruses 
containing inserted foreign DNA. The marker genes to be used include genes that 
10 encode antibiotic or chemical resistance (e.g., see Spyropoulos et al., J. Virol., 
62:1046 (1988); Falkner and Moss., J. Virol., 62:1849 (1988); Franke et al., Mol. 
Cell. Biol., 5:1918 (1985), as well as genes such as the E. coli lacZ gene, that 
permit identification of recombinant viral plaques by calorimetric assay (Panicali 
et al., Gene, 47:193-199 (1986)). 
15 Homologous recombination between donor plasmid DNA and viral 

DNA in an infected cell are made using standard techniques. The recombination 
results in the formation of recombinant viruses that incorporate the nucleic acid 
encoding SEQ ID NO:29 for human mutated AGR2 or SEQ ID NO:5 for human 
wild type AGR2. Appropriate host cells for in vivo recombination are eukaryotic 
20 cells that can be infected by the virus and transfected by the plasmid vector such 
as chick embryo fibroblasts, HuTK143 (human) cells, and CV-1 and BSC-40 
(both monkey kidney) cells. Infection of cells by the virus and transfection of 
these cells with plasmid vectors is accomplished by techniques standard in the art. 

Following in vivo recombination, recombinant viral progeny are 
identified by co-integration of a gene encoding a marker or indicator gene with the 
foreign gene(s) of interest, which, in this case, is the P-galactosidase gene. The 
presence of the P-galactosidase gene is selected using the chromogenic substrate 
5-bromo-4-chloro-3-indolyl-p-D-galactosidase (Panicali et al., Gene, 47:193 
(1986)). Recombinant virus appears as blue plaques in the host cell. Expression of 
the polypeptide encoded by the inserted gene is further confirmed by in situ 
enzyme immunoassay performed on viral plaques and confirmed by Western blot 
analysis, radioimmunoprecipitation (RIPA), and enzyme immunoassay (EIA). 
Positive viruses are cultured and expanded and stored. 
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Example 14 siRNA Generation and Use in Therapy 

Production of RNAs 

Sense RNA (ssRNA) and antisense RNA (asRNA) of AGR2 are 
produced using known methods such as transcription in RNA expression vectors. 

5 In the initial experiments, the sense and antisense RNA are about 500 bases in 
length each. The produced ssRNA and asRNA (0.5 nM) in 10 mM Tris-HCl (pH 
7.5) with 20 mM NaCl were heated to 95°C for 1 min, then cooled and annealed 
at room temperature for 12 to 16 h. The RNAs were precipitated and resuspended 
in lysis buffer (below). To monitor annealing, RNAs were electrophoresed in a 

10 2% agarose gel in TBE buffer and stained with ethidium bromide (Sambrook et 
al., Molecular Cloning. Cold Spring Harbor Laboratory Press, Plainview, N.Y. 
(1989)). 

Lvsate Preparation 

Untreated rabbit reticulocyte lysate (Ambion) are assembled 

15 according to the manufacturers directions. dsRNA was incubated in the lysate at 
30°C for 10 min prior to the addition of mRNAs. Then AGR2 mRNAs are added 
and the incubation continued for an additional 60 min. The molar ratio of double 
stranded RNA and mRNA is about 200:1. The AGR2 mRNA is radiolabeled 
(using known techniques) and its stability is monitored by gel electrophoresis. 

20 In a parallel experiment made with the same conditions, the double 

stranded RNA is internally radiolabeled with a- 32 P-ATP. Reactions are stopped 
by the addition of 2x proteinase K buffer and deproteinized as described 
previously (Tuschl et al., Genes Dev., 13:3191-3197 (1999)). Products are 
analyzed by electrophoresis in 15% or 18% polyacrylamide sequencing gels using 

25 appropriate RNA standards. By monitoring the gels for radioactivity, the natural 
production of 10 to 25 nt RNAs from the double stranded RNA can be 
determined. 

The band of double stranded RNA, about 21-23 bps, is eluted. The 
efficacy of these 21-23 mers for suppressing AGR2 transcription may be assayed 
30 in vitro using the same rabbit reticulocyte assay described above using 50 
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nanomolar of double stranded 21-23 mer for each assay. The sequence of these 
21-23mers is then determined using standard nucleic acid sequencing techniques. 

RNA Preparation 

21 nt RNAs, based on the sequence determined above, were 

5 chemically synthesized using Expedite RNA phosphoramidites and thymidine 
phosphoramidite (Proligo, Germany). Synthetic oligonucleotides were deprotected 
and gel-purified (Elbashir, S. M, Lendeckel, W. & Tuschl, T., Genes & Dev. 15, 
188-200 (2001)), followed by Sep-Pak CI 8 cartridge (Waters, Milford, Mass., 
USA) purification (Tuschl, T., et al., Biochemistry, 32:1 1658-1 1668 (1993)). 

10 These RNAs (20 \xM) single strands are incubated in annealing 

buffer (100 mM potassium acetate, 30 mM HEPES-KOH at pH 7.4, 2 mM 
magnesium acetate) for 1 min at 90°C. followed by 1 h at 37°C. 

Cell Culture 

Cell cultures that regularly express AGR2, including, but not 

15 limited to FDC-P1, J774A.1 and WEHI-231 cells, are propagated using standard 
conditions. 24 hours before transfection, at approx. 80% confluency, the cells are 
trypsinized and diluted 1:5 with fresh medium without antibiotics (l-3xl0 5 
cells/ml) and transferred to 24-well plates (500 |xl/well). Transfection is performed 
using a commercially available lypofection kit and AGR2 expression is monitored 

20 using standard techniques with positive and negative control. Positive control is 
cells that naturally express AGR2 while negative control is cells that do not 
express AGR2. It is seen that base-paired 21 and 22 nt siRNAs with overhanging 
y ends mediate efficient sequence-specific mRNA degradation in lysates and in 
cell culture. Different concentrations of siRNAs are used. An efficient 

25 concentration for suppression in vitro in mammalian culture is between 25 nM to 
100 nM final concentration. This indicates that siRNAs are effective at 
concentrations that are several orders of magnitude below the concentrations 
applied in conventional antisense or ribozyme gene targeting experiments. 

The above method provides a way both for the deduction of AGR2 

30 siRNA sequence and the use of such siRNA for in vitro suppression. In vivo 
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suppression may be performed using the same siRNA using well known in vivo 
transfection or gene therapy transfection techniques. 

This invention has been described in detail including the preferred 
embodiments thereof. However, it will be appreciated that those skilled in the art, 
5 upon consideration of this disclosure, may make modifications and improvements 
thereon without departing from the spirit and scope of the invention as set forth in 
the claims. All references, patents, patent applications and Genbank references 
recited in this patent application are hereby incorporated by reference in their 
entirety. 

10 Example 15 Method for the Production of transgenic non-human Animals 
carrying a Transgene of Agr2, produced by Gene Targeting 
Technology 

Transgenic mice carrying a mammalian Agr2 transgene are 
generated by either using the embryonic stem cell method, or the pronucleus 
15 method, both of them well-known methods in the art; preferably using the method 
of Nehls and Wattler, as described in WO 01/75127. For transgenic methods see 
also US patents US 6,436,701, US 6,018,097, US 5,942,435, US 5,824,837, US 
5,731,489, and US 5,523,226. 

Example 16 Agr2 Signal Peptide Prediction 

20 The publicly available program "SignalP VI .1" was used to predict 

the probabilities of N-terminal signal peptides in murine and human Agr2 
(Nielsen et al., 1997). The C-score (raw cleavage site score) of "SignalP Via" 
represents the output score from networks trained to recognize cleavage sites vs. 
other sequence positions. It was trained to be high at position +1 (immediately 

25 after the cleavage site) and low at all other positions. The S-score (signal peptide 
score) of "SignalP VI. 1" represents the output score from networks trained to 
recognize signal peptide vs. non-signal-peptide positions. It was trained to be high 
at all positions before the cleavage site and low at 30 positions after the cleavage 
site and in the N-terminals of non-secretory proteins. The Y-score (combined 

30 cleavage site score) of "SignalP VI. 1" represents the prediction of cleavage site 
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location is optimized by observing where the C-score is high and the S-score 
changes from a high to a low value. The Y-score formalizes this by combining the 
height of the C-score with the slope of the S-score. Specifically, the Y-score is a 
geometric average between the C-score and a smoothed derivative of the S-score 
5 (i.e., the difference between the mean S-score over d positions before and d 
positions after the current position, where d varies with the chosen network 
ensemble). All three scores are averages of five networks trained on different 
partitions of the data. 

For mouse Agr2 the program predicts with a high probability an N- 
10 terminal signal sequence encoded by the amino acids 1 to 20, and a cleavage site 
between amino acid 20 and 21 (see Figure 15 A). 

For human AGR2 the program predicts with a high probability an 
N-terminal signal sequence encoded by the amino acids 1 to 20, and a cleavage 
site between amino acid 20 and 21 (see Figure 15B). 

15 

Example 17 Amino Acid Comparison between mouse and human AGR2 

The open reading frame of the mouse and human AGR2 cDNAs 
described herein encode deduced proteins of each 175 amino acids in size. 
Structural analysis of the sequence reveals a high probability for a translocation 

20 signal peptide which is removed after passing through the membrane. In both 
peptides, the most probable cleavage point is between amino acid 20 and 21 (LA- 
RD in human; LA-KD in mouse), creating a mature protein of 155 aa each. Signal 
peptide prediction was performed as described in Example 16 and as shown in 
Figures 15A and 15B, using the website of Center for Biological Sequence 

25 Analysis, BioCentrum-DTU, Technical University of Denmark, www.cbs.dtu.dk\ 
The degree of amino acid identity between mouse and human Agr2 peptide is 
91%, whereas the degree of similarity reaches 95%. 

Example 18 Characterization of Agr2 Proteins from different Species - 
Amino Acid Conservation 
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1 . In an inter-species comparison of mouse, rat, and human Agr2 
peptide amino acids, the overall degree of identity is almost 91%, whereas the 
degree of similarity reaches 95%. The high degree of amino acid identity and 
similarity is indicative for highly conserved residues between the species (see 

5 Figure 16 and Table 1), indicating functional significance of these conserved 
residues in the peptides compared in this Example. The amino acid that is 
exchanged in the MTZ phenotype, 137V, is identical between the species 
compared. 

2. In an inter-species comparison of mouse, rat, human and 
10 Xenopus laevis Agr2 peptide amino acids, the overall degree of identity is 67%, 

whereas the degree of similarity reaches 82%. The high degree of amino acid 
identity and similarity is indicative for highly conserved residues between the 
species (see Figure 17 and Table 2), indicating functional significance of these 
conserved residues in the peptides compared in this Example. Again, the amino 
15 acid that is exchanged in the MTZ phenotype, 137V, is identical between the 
species compared. 

3. In an inter-species comparison of mouse, rat, human, Xenopus 
laevis, and C elegans Agr2 peptide amino acids, the overall degree of identity is 
32%, whereas the degree of similarity reaches 46%. The degree of amino acid 

20 identity and similarity is indicative for highly conserved residues between the 
species (see Figure 18 and Table 3), indicating functional significance of these 
conserved residues in the peptides compared. The amino acid exchanged in the 
MTZ phenotype, 137V, is identical between the species compared in this 
Example, except for C. elegans. The C. elegans AGR2 protein is bearing a 

25 similar, i.e., nonpolar and hydrophobic, amino acid at the corresponding residue 
position 1 37 (L instead of V). 

Evolutionary pressure has conserved these residues at their 
particular locations in the molecule. It is predicted that any non-conservative aa 
substitution will modify the peptide's normal biological function in a manner 

30 analogous to that observed in the present invention. Hence, identification of such 
an abnormal Agr2 peptide sequence in a biological sample, or of the a cDNA 
encoding such an abnormal Agr2 peptide, will be indicative of an increased 
probability of developing the phenotype of the present invention. 
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Example 19 Xenopus Laevis Cement Gland Differentiation Assay 

A functional analysis of mouse Agr2 protein and orthologue AGR2 
peptides can be performed in an assay described by Aberger et al. ((Aberger et al., 
1998)). The authors demonstrated that overexpression of XAG-2, a secreted 

5 protein which acts specifically at cement glands induces both, ectopic cement 
gland differentiation and expression of anterior neural marker genes in Xenopus 
embryos. XAG-2 is a secreted protein homologue to AGR2. 

The assay can be used as a test for particular genes function in the 
specification of the cement gland during embryonic development. The cement 

10 gland is a mucin secreting organ in Xenopus laevis embryos, being functionally 
similar to goblet cells. 

A PCR fragment carrying a full-length Agr2 cDNA sequence, is 
subcloned into a plasmid vector, i.e. pCR 2A-TOPO (K4500-01, Invitrogen, 
Carlsbad, California, USA), according to the manufacturer's instructions. The 

15 plasmid DNA, bearing the correct Agr2 insert is subject to site-directed 
mutagenesis, using a QuickChange Site-Directed Mutagenesis Kit (200518, 
Stratagene, La Jolla, California, USA), as described in Example 5. 

Altering a particular codon sequence (which encodes a particular 
amino acid) by substitution of one, or two, or three base paires of the codon, will 

20 c give rise to AGR2 proteins bearing non-conservative amino acid exchanges at the 
residue positions indicated in Tables 1, 2, and 3, respectively. 

Capped mRNA is synthesized with an SP6 mMessage mMachine 
Kit (Ambion). A small sample of mRNA is in vitro translated with a reticulocyte 
lysate system (Promega) to analyze the quality of RNAs; or with a different 

25 method as described, for instance, in Sambrook et al. (eds.), Molecular 
Cloning: A Laboratory Manual (2 nd Ed.), Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989; and Ausubel et al. (eds.), Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1993. 
Purified mRNA is injected into early cleavage stage embryos of Xenopus laevis, 

30 as described in Aberger et al., 1 998. 

Depending on the point mutations and on the subsequent non- 
conserved amino acid substitutions introduced (at the residue positions listed in 

the Tables 1, 2, and 3, respectively), AGR2 function is analyzed in respect to 
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specification of mucin secreting cement glands. Morphological and histological 
examinations are performed to analyze for cement gland enlargement or additional 
ectopic cement glands, as described in Aberger et al. 

Example 20 Agr2 Function in Cell Proliferation - DNA labeling in a 
5 Growth Factor Assay 

To measure AGR2 activity in cell proliferation, a DNA labeling 
assay can be used. For mammalian AGR2, colon cancer cell lines like LS174T or 
HT29, can be used. LS174T cells exhibit a goblet cell-like phenotype producing 
significant amounts of secretory mucin, as described by Iwakira and Podolsky 
10 (Am. J. Physiol Gastrointest Liver Physiol 280: Gl 114-G1 123, 2001). HT29 cells 
can differentiate into cells with phenotypical characteristics of enterocytes and 
mucin-secreting goblet cells. Any other cells, which are responsive to AGR2 can 
be used. 

AGR2 expression vectors, bearing wt and mutated cDNA 

15 sequences of an mammalian Agr2 gene, and additional control vectors are 
constructed as described in Example 10. A preferred method is the cDNA 
subcloning into expression vectors of the Gateway cloning and expression system 
(Invitrogen, California, USA), according to the manufacturer's instructions. 

There are several protocols to perform cell proliferation assays that 

20 are well known in the art. Typically, the incorporation of a nucleoside analog into 
newly synthesized DNA is employed to measure proliferation (active cell growth) 
in a population of cells. For example, Bromodeoxyuridine (BrdU) can be 
employed as a DNA labeling reagent and Anti-BrdU mouse monoclonal antibody 
can be employed as a detection reagent. This antibody binds only to cells 

25 containing DNA which has incorporated BrdU. A number of detection methods 
can be used in conjunction with this assay including immunofluorescence, 
immunohistochemical, ELISA and colorimetric methods. Kits that include BrdU 
and anti-BrdU mouse monoclonal antibody are commercially available from F. 
Hoffmann-La Roche Ltd (Basel, Switzerland). The assay is performed as 

30 indicated in the manufacturer's protocol. 
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Example 21 Agr2 Function in Goblet Cell Differentiation - Analysis of 
Goblet Cell specific Markers in a quantitative PCR Assay 

To measure AGR2 activity in goblet cell differentiation, e.g., in 
either early or terminal goblet cell differentiation, a cell culture based assay can be 
5 used. For mammalian AGR2, colon cancer cell lines like LS 1 74T or HT29, can be 
used. LS174T cells exhibit a goblet celUike phenotype producing significant 
amounts of secretory mucin, as described by Iwakira and Podolsky (Am. J. 
Physiol Gastrointest Liver Physiol 280: G1114-G1123, 2001). HT29 cells can 
differentiate into cells with phenotypical characteristics of enterocytes and mucin- 
10 secreting goblet cells. 

AGR2 expression vectors, bearing wt and mutated cDNA 
sequences of an mammalian Agr2 gene, and additional control vectors are 
constructed as described in Example 10. A preferred method is the cDNA 
subcloning into expression vectors of the Gateway cloning and expression system 
15 (Invitrogen, California, USA), according to the manufacturer's instructions. 

Cells are transfected with expression vectors as described above. 
Transfection of culture cells with expression vectors is well known in the art and 
described, for instance, in Sambrook et al. (eds.), Molecular Cloning: A 
Laboratory Manual (2 nd Ed.), Cold Spring Harbor Laboratory Press, Cold 
20 Spring Harbor, NY, 1989; and Ausubel et al. (eds.), Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, NY, 1993. 

A major mucin subtype secreted by intestinal goblet cells is mucin2 
(muc2). Mucin2 serves, like mucin subtype TFF3, as a marker for terminal 
differentiation. Human muc2 primers are designed to PCR amplify an about 200 
25 bp DNA fragment at cDNA, which is freshly synthezised at mRNA of transfected 
and non-transfected controle cells. The quantitative PCR analysis (Light cycler; 
Roche, Basel, Switzerland) is performed, according to the manufacturer's 
instruction. 

AGR2 function in goblet cell differentiation is analyzed by 
30 quantitative determination of human muc2 PCR products. The amount of specific 
PCR product is depending on the paticular type of AGR2 expression vector (wild 
type cDNA, mutated cDNA, position of mutation) used for transfection. The 
analysis is not limited to muc2. 
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Example 22 AGR2 Mutations resulting in abnormal AGR2 Protein 
Expression Levels 

It is predicted that any mutation in the AGR2 gene resulting in 
abnormal AGR2 peptide expression levels in an individual will interfere with the 
peptide's normal biological function, including in a manner analogous to that 
observed in the present invention. Mutations leading to abnormal AGR2 peptide 
expression levels might affect any aspect of gene expression, e.g. DNA 
transcription, mRNA transport and processing, mRNA translation or AGR2 
peptide half-life itself. 

For instance, identification of an abnormal AGR2 peptide level in a 
biological sample will be indicative of an increased probability of developing the 
phenotype of the present invention. Methods for quantifying the peptide 
expression levels in a biological sample are well known in the art. AGR2 peptide 
levels could be analysed by obtaining a biopsy from an individual and quantifying 
the amount of AGR2 peptide by the use of an antibody or any other probe 
specifically recognizing the AGR2 peptide, e.g. using an ELISA or a Western 
Blot. 

Alternatively, identification of an abnormal AGR2 mRNA level in 
a biological sample will be indicative of an increased probability of developing 
the phenotype of the present invention. Methods for quantifying the mRNA 
expression levels in a biological sample are well known in the art. AGR2 mRNA 
levels could be analysed by obtaining a biopsy from an individual and quantifying 
the amount of AGR2 mRNA by the use of quantitative RT-PCR or any other 
method relying on probes specifically recognizing the AGR2 mRNA. 

Alternatively, identification of an abnormal AGR2 mRNA 
transport and processing in a biological sample will be indicative of an increased 
probability of developing the phenotype of the present invention. AGR2 mRNA 
processing could be analysed by obtaining a biopsy from an individual and 
quantifying the processing of AGR2 mRNA by the use of Northern blotting or 
qualitative RT-PCR or any other method relying on probes specifically 
recognizing the AGR2 mRNA processing. 
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Moreover, any given mutation in the AGR2 gene could be tested 
for its effect on AGR2 expression by using an appropriate artificial expression 
system. 

For instance, a cDNA encoding any given mutated AGR2 peptide 
5 could be isolated and expressed in any suitable expression system. The amount of 
expressed AGR2 peptide or mRNA or the AGR2 mRNA transport and processing 
could be analysed by using methods analogous to those mentioned above. 

Alternatively, regulatory sequences of the AGR2 gene could be 
isolated and analysed in any suitable expression system. Expression levels of an 
10 appropriate reporter gene would be indicative for the efficiency of the AGR2 
regulatory sequences to direct gene expression. 

Once mutations in the AGR2 gene resulting in abnormal AGR2 
peptide expression levels in an individual or in a suitable expression system are 
identified, this knowledge might be used to screen any suitable biological sample 
15 for presence of such a mutation by means well known in the art, including 
sequencing of the individual's AGR2 cDNA or genomic DNA. Individuals 
carrying any of the previously characterized mutations will bare an increased risk 
of developing the phenotype of the present invention. 

Example 23 Statistical Analysis of Populations to identify Correlations 
20 between AGR2 Haplotype and Disease Risk 

In order to identify mutants of the human AGR2 gene, which are 
indicative of an increased probability of developing the phenotype described by 
the present invention, the AGR2 haplotypes are determined from defined 
collectives of patients displaying a disease phenotype reminiscent to that 

25 described in the present invention in comparison to a suitable healthy control 
population. AGR2 alleles, which are significantly over-represented in the affected 
population versus the control population are correlated with the disease risk, see in 
Griffiths, Anthony J.F.; Gelbart, William M; Miller, Jeffrey H.; Lewontin, 
Richard C. Modern Genetic Analysis. New York: W H Freeman & Co; cl999. 

30 Therefore, individuals carrying any of these over-represented 

AGR2 alleles will bare an increased risk of developing the phenotype of the 
present invention. 
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Example 24 Detection of transcriptionally deregulated Genes expressed in 
the Colon. 

A series of genes selected for their putative biological relevance to 
goblet cell function were analysed for altered RNA expression levels in the colon 

5 of newborn MTZ mice, in comparison to expression levels in colon of wild type 
mice. Significantly reduced expression levels were found for Mucin2 (Muc2) and 
Trefoil factor 3 (TFF3), as shown in Figure 19. Both genes encode the major 
protein components of mucin and both proteins, Muc-2 and TFF3, serve as marker 
for late goblet cell differentiation. Reduced transcriptional activity of these 

10 differentiation marker genes is indicative of an incomplete maturation process of 
the goblet cells. Transcriptional deregulation was determined by quantitative 
PCR-Light Cycler technology (Roche Diagnostics GmbH, Mannheim, Germany), 
according to the manufacturer's instructions. 
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Table 1 . Conserved amino acid residues in mouse, rat, and human 
a) identical residues 



M1 


E2 


K3 


V6 


S7 A8 


L10 


L11 L12 


V13 A14 


S16 


T18 


L19 A20 


D22 


T23 


T24 


V25 


K26 G28 


K30 


K31 D32 


K34 -D35 — S36— R37 — P38— K39 


L40 


P41 


Q42 


T43 


L44 S45 


R46 


G47 W48 


G49 D50 


Q51 


L52 


153 W54 


T55 


Q56 


T57 


Y58 


E59 E60 


A61 


L62 Y63 


S65 K66 


T67 


S68 


N69 P71 


L72 


M73 


I75 


H76 


H77 L78 


D79 


E80 C81 


P82 H83 


S84 


Q85 


A86 L87 


K88 


K89 


V90 


F91 


A92 E93 


K95 


E96 I97 


0.98 K99 


L100 


A101 


E102 Q103 


F104 


V105 


L106 


L107 


N108 L109 


Y111 


E112 T113 


T114 D115 


K116 


H117 


L118 S119 


P120 


D121 


G122 


Q123 Y124 V125 P126 


R127 1128 


F130 V131 


D132 


P133 


S134 L135 


T136 


V137 


R138 


A139 


D140 1141 


T142 


G143 R144 Y145 S146 


N147 


R148 


L149 Y150 


A151 


Y152 


E153 


P154 


D156 T157 A158 


L159 L160 


D162 N163 


M164 K165 


K166 A167 


L168 


K169 


L170 


L171KT173 E174 L175 













b) similar residues 



lorL15 KorR21 AorS29 KorR64 RorK70 Vorl73 V or 1110 
VorM129 SorA155 



Explanation of amino acid single letter code: 

A=Ala R=Arg N=Asn D=Asp OCys E=Glu Q=Gln G=Gly 

H=His Nile L=Leu K=Lys M=Met F=Phe P=Pro S=Ser 
T=Thr W=Trp Y=Tyr V=Val 
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Table 2. Conserved amino acid residues in mouse, rat, human, 
and Xenopus. 

a) identical residues in respect to mouse, rat, and human 
amino acid positions. 



M1 E2 S7 L11 L12 V13 A14 S16 T18 L19 A20 P41 Q42 T43 L44 

S45 R46 G47 W48 G49 D50 L52 W54 Q56 T57 Y58 E59 E60 L62 K66 

N69 P71 L72 I75 H77 C81 P82 H83 S84 Q85 A86 L87 K88 K89 F91 

A92 E93 197 Q98 K99 L100 A101 E102 F104 L106 U07 N108 L109 Y111 T114 

D115 K116 L118 D121 G122 Q123 Y124 V125 P126 F130 V131 D132 P133 S134 L135 

V137 R138 A139 D140 G143 Y145 S146 N147 Y150 Y152 E153 P154 D156 L160 N163 

M164 K165 K166 A167 L168 L170 L171KT173 E174 L175 



b) similar residues in respect to mouse, rat, and human 
amino acid positions. 

lorL15 KorR20 DorE21 A or S29 KorR39 QorN51 AorG61 YorF63 

KorR64 SorA65 TorS67 RorK70 MorL73 VorlorL74 DorN79 . EorD80 

QorE103 V or 1105 Lor 1109 V or 1110 PorK127 lorV128 VorM129 lorL141 

RorK144 RorH148 DorE161 



Explanation of amino acid single letter code: 
A=Ala R=Arg N=Asn D=Asp 
H=His l=lle L=Leu K=Lys 

T=Thr W=Trp Y=Tyr V=Val 



OCys E=Glu Q=Gln G=Gly 
M=Met F=Phe P=Pro S=Ser 
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Table 3. Conserved amino acid residues in mouse, rat, human, 
Xenopus, and C. elegans. 

a) identical residues in respect to mouse, rat, and human 
amino acid positions. 

S7 L12 L44 R46 G47 G49 D50 W54 E59 P71 H77 C81 A86 187 K88 
K89 F91 K99 L100 E102 F104 N108 D121 G122 Y124 F130 D132 Y150 Y152 D132 
M164 K165 L168 



b) similar residues in respect to mouse, rat, and human 
amino acid positions. 



I or L1 5 K or R20 D or E21 A or S29 K or R39 Q or N51 A or G61 Y or F63 

KorR64 SorA65 TorS67 RorK70 MorL73 VorlorL74 DorN79 EorD80 

QorE103 V or 1105 Lor 1109 V or 1110 PorK127 lorV128 VorM129 IorL141 

RorK144 RorH148 DorE161 VorL13 SorA16 QorN42 SorA45 WorF48 

Lor!52 YorW58 EorD60 Lorl62 NorD69 Lorl72 lorL75 EorQ93 

AorS101 LorM106 LorV107 DorE115 Vorl125 VorL131 VorL137 SorA146 

Lor 1160 EorD174 



Explanation of amino acid single letter code: 

A=Ala R=Arg N=Asn D=Asp C=Cys E=Glu Q=Gln G=Gly 

H=His l=I!e L=Leu K=Lys M=Met F=Phe P=Pro S=Ser 
T=Thr W=Trp Y=Tyr V=Val 
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SUMMARY OF SEQUENCES 



SEQ ID NO:l: Agr2 mouse nuc-seq Mutant C3H 
5 SEQ ID NO:2:Agr2 mouse prot-seq Mutant 
SEQ ID NO:3: Agr2 mouse prot-seq WT 
SEQ ID NO:4: AGR2 human prot-seq WT 
SEQ ID NO:5: AGR2 human nuc-seq WT 
SEQ ID NO:6: Agr2 mouse nuc-seq WT 

10 SEQ ID NO:7: mAgr2-7 primer 
SEQ ID NO:8: mAgr2-2 primer 
SEQ ID NO:9: hAgr-1 primer 
SEQ ID NO: 10: hAgr-4 primer 
SEQ ID NO:ll: Idb2-SNP-marker 

15 SEQ ID NO: 12: primerl Idb2-SNP-marker 
SEQ ID NO: 13: primer2 Idb2-SNP-marker 
SEQ ID NO:14: D12Mit64 MIT-marker 
SEQ ID NO:15: primerl D12MU64 MIT -marker 
SEQ ID NO:16: primer2 D12Mit64 MIT -marker 

20 SEQ ID NO:17-28: agr2 primers 1-12 

SEQ ID NO:29: AGR2 human nuc-seq Mutant 
SEQ ID NO:30: AGR2 human prot-seq Mutant 
SEQ ID NO:31 hAgr2-3 primer 
SEQ ID NO:32: hAgr2-4 primer 

25 SEQ ID NO.-33: PCR product of hAgr2-3 and hAgr2-4 



SEQ ID NO:3 listed below (i.e., the wild type mouse Agr2 protein sequence) 
corresponds to the sequence to be found in Genbank under accession number 
30 NP 035913. 

SEQ ID NO:4 listed below (i.e., the wild type human AGR2 protein sequence) 
corresponds to the sequence to be found in Genbank under accession number 
NP 006399. 

35 



SEQ ID NO:l nucleic acid sequence (cDNA) of mutant Agr2 (mus 
musculus; C3H) 

40 

GGCAACCCTTGCGGCTCACACAAAGCAGGAGGGTGGGAAGCCCAGATTTGCCATGGAGAAATTTTC 
AGTGTCTGCAATCCTGCTTCTTGTGGCCATTTCTGGTACCTTGGCCAAAGACACCACAGTCAAATC 
TGGAGCCAAAAAGGACCCAAAGGACTCTCGGCCCAAACTACCTCAGACACTCTCCAGAGGTTGGGG 
CGATCAGCTCATCTGGACTCAGACATACGAAGAAGCTTTATACAGATCCAAGACAAGCAACAGACC 
45 CTTGATGGTCATTCATCACTTGGACGAATGCCCACACAGTCAAGCCTTAAAGAAAGTGTTTGCTGA 
ACATAAAGAAATCCAGAAATTGGCAGAGCAGTTTGTTCTCCTCAACCTGGTCTATGAAACAACCGA 
CAAGCACCTTTCTCCTGATGGCCAGTACGTCCCCAGAATTGTGTTTGTAGACCCATCCCTGACGGg 
GAGGGCAGACATCACTGGACGATACTCAAACCGGCTCTACGCTTATGAACCTTCTGACACAGCTTT 
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GTTGTACGACAACATGAAGAAAGCTCTCAAGCTGCTAAAGACAGAATTGTAGAGCTAACTGCGGAC 
CGGGTCAGGAGACCAGAAGGCAGAAGCACTGTGGACTTGCAGATTACAGTACAGTTTAATGTTACA 
ACAGATATATTTTTTAAACACCCACAGGTGGGGAAACAATATTATTATCTACTACAGTGAAGCATG 
ATTTTCTAGAAAATAAAGTCTTGTGAGAACTCCAAAAAAAAAAAAAAAAAAAAAA 

5 

Start and stop-codons are underlined. The mutated base is boxed; 
the wild type-sequence carries a T at the boxed position. 

10 SEQ ID NO: 2 amino acid sequence (aa) of mutant Agr2 (mus 
mus cuius) 

MEKFSVSAILLLVAISGTLAKDTTVKSGAKKDPKDSRPKLPQTLSRGWGDQLIWTQTYEEALYRSK 
TSNRPLMVIHHLDECPHSQALKKVFAEHKEIQKLAEQFVLLNLVYETTDKHLSPDGQYVPRIVFV 
15 PSLTgRADITGRYSNRLYAYEPSDTALLYDNMKKALKLLKTEL 

The mutated aa is boxed; the wild type-sequence carries a V at the 
boxed position. 

20 

SEQ ID NO: 3 amino acid sequence (aa) of wild type Agr2 (mus 
mus cuius) 

MEKFSVSAILLLVAISGTLAKDTTVKSGAECKDPKDSRPKLPQTLSRGWGDQLIWTQTYEEALYRSK 
25 TSNRPLMVIHHLDECPHSQALKKVFAEHKEIQKI^QFVLLNLVYETTDKHLSPDGQYVPRIVFVD 
PSLT0RADITGRYSNRLYAYEPSDTALLYDNMKKALKLLKTEL 

The mutated aa is boxed; the mutant-sequence carries an E at the 
boxed position. 

30 

SEQ ID NO: 4 amino acid sequence (aa) of wild type AGR2 (human) 

MEKIPVSAFLLLVALSYTLARDTTVKPGAKKDTKDSRPKLPQTLSRGWGDQLIWTQTYEEALYKSK 
35 TSNKPLMIIHHLDECPHSQALKKVFAENKEIQKLAEQFVLLNLVYETTDKHLSPDGQYVPRIMFVD 
PSLT@RADITGRYSNRLYAYEPADTALLLDNMKKALKLLKTEL 

The aa corresponding to the aa mutated in mouse is boxed; a 
mutant-sequence would carry an E at the boxed position. 

40 

SEQ ID NO: 5 nucleic acid sequence (cDNA) of human AGR2 

CCGCATCCTAGCCGCCGACTCACACAAGGCAGGTGGGTGAGGAAATCCAGAGTTGCCATGGAGAAA 
45 ATTCCAGTGTCAGCATTCTTGCTCCTTGTGGCCCTCTCCTACACTCTGGCCAGAGATACCACAGTC 
AAACCTGGAGCCAAAAAGGACACAAAGGACTCTCGACCCAAACTGCCCCAGACCCTCTCCAGAGGT 
TGGGGTGACCAACTCATCTGGACTCAGACATATGAAGAAGCTCTATATAAATCCAAGACAAGCAAC 
AAACCCTTGATGATTATTCATCACTTGGATGAGTGCCCACACAGTCAAGCTTTAAAGAAAGTGTTT 
GCTGAAAATAAAGAAATCCAGAAATTGGCAGAGCAGTTTGTCCTCCTCAATCTGGTTTATGAAACA 
50 ACTGACAAACACCTTTCTCCTGATGGCCAGTATGTCCCCAGGATTATGTTTGTTGACCCATCTCTG 
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aca^tt]agagccgatatcactggaagatattcaaatcgtctctatgcttacgaacctgcagataca 

GCTCTGTTGCTTGACAACATGAAGAAAGCTCTCAAGTTGCTGAAGACTGAATT GTAAA GAAAAAAA 
ATCTCCAAGCCCTTCTGTCTGTCAGGCCTTGAGACTTGAAACCAGAAGAAGTGTGAGAAGACTGGC 
TAGTGTGGAAGCATAGTGAACACACTGATTAGGTTATGGTTTAATGTTACAACAACTATTTTTTAA 

5 GAAAAACAAGTTTTAGAAATTTGGTTTCAAGTGTACATGTGTGAAAACAATATTGTATACTACCAT 
AGTGAGCCATGATTTTCTAAAAAAAAAAATAAATGTTTTGGGGGTGTTCTGTTTTCTCCAACTTGG 
TCT TTCACAGTGGTTCGT T T ACCAAATAGGAT TAAACACACACAAAATGCTCAAGGAAGGGACAAG 
ACAAAACCAAAACTAGTTCAAATGATGAAGACCAAAGACCAAGTTATCATCTCACCACACCACAGG 
TTCTCACTAGATGACT GTAAGT AGACACGAGCT T AATCAACAGAAGTATCAAGCCATGTGCTT TAG 

10 CATAAAAGAATATTTAGAAAAACATCCCAAGAAAATCACATCACTACCTAGAGTCAACTCTGGCCA 
GGAACTCTAAGGTACACACTTTCATTTAGTAATTAAATTTTAGTCAGATTTTGCCCAACCTAATGC 
TCTCAGGGAAAGCCTCTGGCAAGTAGCTTTCTCCTTCAGAGGTCTAATTTAGTAGAAAGGTCATCC 
AAAGAACATCTGCACTCCTGAACACACCCTGAAGAAATCCTGGGAATTGACCTTGTAATCGATTTG 
TCTGTCAAGGTCCTAAAGTACTGGAGTGAAATAAATTCAGCCAACATGTGACTAATTGGAAGAAGA 

15 GCAAAGGGTGGTGACGTGTTGATGAGGCAGATGGAGATCAGAGGTTACTAGGGTTTAGGAAACGTG 
AAAGGCTGTGGCATCAGGGTAGGGGAGCATTCTGCCTAACAGAAATTAGAATTGTGTGTTAATGTC 
TTCACTCTATACTTAATCTCACATTCATTAATATATGGAATTCCTCTACTGCCCAGCCCCTCCTGA 
TTTCTTTGGCCCCTGGACTATGGTGCTGTATATAATGCTTTGCAGTATCTGTTGCTTGTCTTGATT 
AACTTTTTTGGATAAAACCTTTTTTGAACAGAAAAAAAAAAAAAAAAAAAA 

20 

Start and stop-codons are underlined. The codon encoding valin at 
position 137 of the protein sequence is boxed. The point mutation 
to underline I 

25 

SEQ ID NO: 6 nucleic acid sequence (cDNA) of wild type Agr2 (mus 
musculus; C3H) 

GGCAACCCTTGCGGCTCACACAAAGCAGGAGGGTGGGAAGCCCAGATTTGCCATGGAGAAATTTTC 
30 AGTGTCTGCAATCCTGCTTCTTGTGGCCATTTCTGGTACCTTGGCCAAAGACACCACAGTCAAATC 
TGGAGCCAAAAAGGACCCAAAGGACTCTCGGCCCAAACTACCTCAGACACTCTCCAGAGGT TGGGG 
CGATCAGCTCATCTGGACTCAGACATACGAAGAAGCTTTATACAGATCCAAGACAAGCAACAGACC 
CTTGATGGTCATTCATCACTTGGACGAATGCCCACACAGTCAAGCCTTAAAGAAAGTGTTTGCTGA 
ACATAAAGAAATCCAGAAATTGGCAGAGCAGTTTGTTCTCCTCAACCTGGTCTATGAAACAACCGA 
35 CAAGCACCTTTCTCCTGATGGCCAGTACGTCCCCAGAATTGTGTTTGTAGACCCATCCCTGACG(^ 
GAGGGCAGACATCACTGGACGATACTCAAACCGGCTCTACGCTTATGAACCTTCTGACACAGCTTT 
GTTGTACGACAACATGAAGAAAGCTCTCAAGCTGCTAAAGACAGAATTGTAGAGCTAACTGCGCAC 
CGGGTCAGGAGACCAGAAGGCAGAAGCACTGTGGACTTGCAGATTACAGTACAGTTTAATGTTACA 
AC AGAT AT AT TT TTTAAACACCCACAGGTGGGGAAACAATATTATTATCTACT ACAGTGAAGCATG 
40 ATTTTCTAGAAAATAAAGTCTTGTGAGAACTCCAAAAAAAAAAAAAAAAAAAAAA 

Start and stop-codons are underlined. The mutated base is boxed; 
the mutant- sequence carries an A at the boxed position. 

45 
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SEQ ID NO: 7 mAgr2-7 primer (artificial) 
CAGACCCTTGATGGTCATTC -3' 

SEQ ID NO: 8 mAgr2-2 primer (artificial) 
5'- GTCTCCTGACCCGGTGCGCAG -3' 

SEQ ID NO: 9 hAGR2-l primer (artificial) 
5'- GAACCTGCAGATACAGCTCTG -3' 

SEQ ID NO: 10 hAGR2-4 primer (artificial) 
5'- CACACTAGCCAGTCTTCTCAC -3' 

SEQ ID NO: 11 idb2-SNP-marker (mus musculus) 



CTAAACTGCGTTTCTCTCCCAATCTTTTGCAGGCATTTGGGGACTTTTTCTTTTCTTTTTACTTTC 
TCTTTTTCTTTTGCACAAGAAGAAGTCTACAAGATCTTTTAAGACTTTTGTTATCAGCCATTTCAC 

25 CAGGAGAACACGTTGAATGGACCTTTTTAAAAAGAAAGCGGAAGGAAAACTAAGGATGATCGTCTT 
GCCCAGGTGTCTTGTTCTCCGGCCTGGACTGTGATACCGTTATTTATGAGAGACTTTCAGTGCCCT 
TTCTACAGTTGGAAGGTTTTCTTTATATACTATTCCCACCATGGGGAGCGAAAA_[G/CJ_GTTAAAA 
AAAAAAGAAAAAAATCACAAGGAATTGCCCAATGTAAGCAGACTTTGCCTTTTCACAAAGGTGGAG 
CGTGAATTCCAGAAGGACCCAGTATTCGGTTACTTAAATGAAGTCTTCGGTCAGAAATGGCCTTTT 

30 TGACACGAGCCTACTGAATGCTGTGTATATATTTATATATAAATATATATATATTGAGTGAACCTT 
GTGGACTCTTTAATTAGAGTTTTCTTGTATAGTGGCAGAAATAACCTATTTCTGCATTAAAATGTA 
ATGACGTACTTATGCTAAACTTTTTATAAAAGTTTAGTTGTAAACTTAAC'CCTTTTATACAAAATA 
AATCAAGTGTGTTTATTGAATGTTGATTGCTTGCTTTATTTCAGAC 

35 A SNP position is underlined 

SEQ ID NO: 12 idb2-forward primer (artificial) 
40 5' -CTAAACTGCGTTTCTCTCCCAA-3' 



SEQ ID NO: 13 idb2-reverse primer (artificial) 
45 5 ' -GTCTGAAATAAAGCAAGCAATCAAC- 3 r 

SEQ ID NO: 14 D12Mit64 MIT-marker (mus musculus) 

50 ACGNCTCACTATAGGGCGAATTGGGCCCTCTAGATGCATGCTCGAGNNGGCCGCCAGTGTGCTGGA 
AAGC CTCCTTGAGATCTGAACACTTGTG TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTA 
TGTATATGTGTATAATTATTATTATTAGGGATTGAATCTAGGTAGACATTCTACCAC AGAGACAAA 
CCACCAGCCCTGCTCCTCAAATCCTTACCTCAATTTCTTTTTTTCTTTTTTTTTGTTTTAACCTTC 



55 



TCTTTTTTTATTAGATATTGTCTTCATTTACATTTCAAATGCTATCCCAAAAG 
Primer positions are underlined 



60 



SEQ ID NO: 15 D12Mit64-f orward primer (artificial) 
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5' -CTCCTTGAGATCTGAACACTTGT-3 ' 

SEQ ID NO: 16 D12Mit 64 -reverse primer (artificial) 
5' -GGGCTGGTGGTTTGTCTCT-3' 
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20 
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30 



35 



40 



45 



50 



55 
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SEQ ID NO: 17 agr2-l primer (artificial) 
5' -GGATAGACCACGGATGGATA-3' 

SEQ ID NO: 18 agr2-2 primer (artificial) 
5' -CCCCAGAGAGAACCTGATTA-3' 

SEQ ID NO: 19 agr2-3 primer (artificial) 
5' -GTTCTCTCTGGGGGCTTTT-3' 

SEQ ID NO: 20 agr2-4 primer (artificial) 
5 ' -AAGATGAGTGAGCCAAACCA- 3 ' 

SEQ ID NO: 21 agr2-5 primer (artificial) 
5' -GGAGTGAAGGCAGTCAACAG-3 ' 

SEQ ID NO: 22 agr2-6 primer (artificial) 
5' -GATGGGACTTGGAGGAGATT-3' 

SEQ ID NO: 23 agr2-7 primer (artificial) 
5' -TCTGTAGCCCCCTCTCTCTT-3' 

SEQ ID NO: 24 agr2-8 primer (artificial) 
5' -CACTAAGTCCCACCGAGAAA-3' 

SEQ ID NO: 25 agr2-9 primer (artificial) 
5' -GCTGGGGTAGGAGAT AGGAG- 3 ' 

SEQ ID NO: 26 agr2-10 primer (artificial) 
5' -ATCTTGCCCAACTTCAGTCA-3' 

SEQ ID NO: 27 agr2-ll primer (artificial) 
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5' -TAAGCAGGAAGCAGGAGAGA- 3 ' 

SEQ ID NO: 28 agr2-12 primer (artificial) 

5 

5' -AATATTGTTTCCCCACCTGT-3' 

10 SEQ ID NO: 29 nucleic acid sequence (cDNA) of mutant human AGR2 

CCGCATCCTAGCCGCCGACTCACACAAGGCAGGTGGGTGAGGAAATCCAGAGTTGCCATGGAGAAA 
ATTCCAGTGTCAGCATTCTTGCTCCTTGTGGCCCTCTCCTACACTCTGGCCAGAGATACCACAGTC 
AAACCTGGAGCCAAAAAGGACACAAAGGACTCTCGACCCAAACTGCCCCAGACCCTCTCCAGAGGT 

15 TGGGGTGACCAACTCATCTGGACTCAGACATATGAAGAAGCTCTATATAAATCCAAGACAAGCAAC 
AAACCCTTGATGATTATTCATCACTTGGATGAGTGCCCACACAGTCAAGCTTTAAAGAT^AGTGTTT 
GCTGAAAATAAAGAAATCCAGAAATTGGCAGAGCAGTTTGTCCTCCTCAATCTGGTTTATGAAACA 
ACTGACAAACACCTTTCTCCTGATGGCCAGTATGTCCCCAGGATTATGTTTGTTGACCCATCTCTG 
ACAj^^AGAGCCGATATCACTGGAAGATATTCAAATCGTCTCTATGCTTACGAACCTGCAGATACA 

20 GCTCTGTTGCTTGACAACATGAAGAAAGCTCTCAAGTTGCTGAAGACTGAATTGTAAAGAAAAAAA 
ATCTCCAAGCCCTTCTGTCTGTCAGGCCTTGAGACTTGAAACCAGAAGAAGTGTGAGAAGACTGGC 
TAGTGTGGAAGCATAGTGAACACACTGAT TAGGT TATGGTT TAATGTTACAACAACTATTTTTTAA 
GAAAAACAAGTTTTAGAAATTTGGTTTCAAGTGTACATGTGTGAAAACAATATTGTATACTACCAT 
AGTGAGCCATGATTTTCTAAAAAAAAAAATAAATGTTTTGGGGGTGTTCTGTTTTCTCCAACTTGG 

25 TCTTTCACAGTGGTTCGTTTACCAAATAGGATTAAACACACACAAAATGCTCAAGGAAGGGACAAG 
ACAAAACCAAAACTAGTTCAAATGATGAAGACCAAAGACCAAGTTATCATCTCACCACACCACAGG 
TTCTCACTAGATGACTGTAAGTAGACACGAGCTTAATCAACAGAAGTATCAAGCCATGTGCTTTAG 
CATAAAAGAATATTTAGAAAAACATCCCAAGAAAATCACATCACTACCTAGAGTCAACTCTGGCCA 
GGAACTCTAAGGTACACACTTTCATTTAGTAATTAAATTTTAGTCAGATTTTGCCCAACCTAATGC 

30 TCTCAGGGAAAGCCTCTGGCAAGTAGCTTTCTCCTTCAGAGGTCTAATTTAGTAGAAAGGTCATCC 
AAAGAACATCTGCACTCCTGAACACACCCTGAAGAAATCCTGGGAATTGACCTTGTAATCGATTTG 
TCTGTCAAGGTCCTAAAGTACTGGAGTGAAATAAATTCAGCCAACATGTGACTAATTGGAAGAAGA 
GCAAAGGGTGGTGACGTGTTGATGAGGCAGATGGAGATCAGAGGTTACTAGGGTTTAGGAAACGTG 
AAAGGCTGTGGCATCAGGGTAGGGGAGCATTCTGCCTAACAGAAATTAGAATTGTGTGTTAATGTC 

35 TTCACTCTATACTTAATCTCACATTCATTAATATATGGAATTCCTCTACTGCCCAGCCCCTCCTGA 
TTTCTTTGGCCCCTGGACTATGGTGC 

TGTATATAATGCTTTGCAGTATCTGTTGCTTGTCTTGATTAACTTTTTTGGATAAAACCTTTTTTG 
AACAGAAAAAAAAAAAAAAAAAAAA 

40 Start and stop-codons are underlined. The codon encoding valin at 
position 137 of the protein sequence is boxed. The codon GAR 
stands for either GAA or GAG, each encoding valin. 

45 SEQ ID NO: 30 amino acid sequence (aa) of human mutant AGR2 
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MEKIPVSAFLLLVALSYTLARDTTVKPGAKKDTKDSRPKLPQTLSRGWGDQLIWTQTYEEALYKSK 
TSNKPLMIIHHLDECPHSQALKKVFAENKEIQKLAEQFVLLNLVYETTDKHLSPDGQYVPRIMFVD 
PSLTgRADITGRYSNRLYAYEPADTALLLDNMKKALKLLKTEL 

5 The aa corresponding to the aa mutated in human is boxed; the wild 
type-sequence carries a V at the boxed position, instead of the E 
indicated. 

10 SEQ ID NO: 31 humanagr2-3 primer (artificial) 
5' -GCCATGGAGAAAATTCCAGTGTC-3' 
SEQ ID NO: 32 humanagr2-4 primer (artificial) 

15 

5'-tttacaattcagtcttcagcaacttg-3' 
SEQ ID NO: 33 PCR product (human) 

20 

CCATGGAGAAAATTCCAGTGTCAGCATTCTTGCTCCTTGTGGCCCTCTCCTACACTCTGGCCAGAG 

ATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGGACTCTCGACCCAAACTGCCCCAGACCC 

TCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCAGACATATGAAGAAGCTCTATATAAATCCA 

AGACAAGCAACAAACCCTT GATGATTATTCATCAC TTGGATGAGTGCCCACACAGTCAAGCT TTAA 

25 AGAAAGTGTTTGCTGAAAATAAAGAAATCCAGAAATTGGCAGAGCAGTTTGTCCTCCTCAATCTGG 

TTTATGAAACAACTGACAAACACCTTTCTCCTGATGGCCAGTATGTCCCCAGGATTATGTTTGTTG 

ACCCATCTCTGACAGTTAGAGCCGATATCACTGGAAGATATTCAAATCGTCTCTATGCTTACGAAC 

CTGCAGATACAGCTCTGTTGCTTGACAACATGAAGAAAGCTCTCAAGTTGCTGAAGACTGAATTGT 
AAA 

30 
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SEQUENCE LISTING JCt 7 HC J Cl fv . P * f* i \J> 23' JUN 

<110> Ingenium Pharmaceuticals AG 

5 <120> Methods and Agents for Diagnosis and Prevention, Amelioration or 
Treatment of Goblet Cell-Related Disorders 

<130> ING10631 

10 <140> US 60/436,322 
<141> 2002-12-23 

<160> 34 

15 <170> Patentln version 3.2 

<210> 1 

<211> 781 

<212> DNA 

20 <213> Mus musculus 

<400> 1 

ggcaaccctt gcggctcaca caaagcagga gggtgggaag cccagatttg ccatggagaa 60 

25 attttcagtg tctgcaatcc tgcttcttgt ggccatttct ggtaccttgg ccaaagacac 120 

cacagtcaaa tctggagcca aaaaggaccc aaaggactct cggcccaaac tacctcagac 180 

actctccaga ggttggggcg atcagctcat ctggactcag acatacgaag aagctttata 240 

cagatccaag acaagcaaca gacccttgat ggtcattcat cacttggacg aatgcccaca 300 

cagtcaagcc ttaaagaaag tgtttgctga acataaagaa atccagaaat tggcagagca 360 

gtttgttctc ctcaacctgg tctatgaaac aaccgacaag cacctttctc ctgatggcca 420 

gtacgtcccc agaattgtgt ttgtagaccc atccctgacg gagagggcag acatcactgg 480 

acgatactca aaccggctct acgcttatga accttctgac acagctttgt tgtacgacaa 540 

catgaagaaa gctctcaagc tgctaaagac agaattgtag agctaactgc gcaccgggtc 600 

aggagaccag aaggcagaag cactgtggac ttgcagatta cagtacagtt taatgttaca 660 

acagatatat tttttaaaca cccacaggtg gggaaacaat attattatct actacagtga 720 

agcatgattt tctagaaaat aaagtcttgt gagaactcca aaaaaaaaaa aaaaaaaaaa 780 
a 



30 



35 



40 



45 



50 

<210> 2 

<211> 175 

<212> PRT 

55 <213> Mus musculus 

<400> 2 

Met Glu Lys Phe Ser Val Ser Ala He Leu Leu Leu Val Ala He Ser 
60 1 5 io is 



781 



65 



Gly Thr Leu Ala Lys Asp Thr Thr Val Lys Ser Gly Ala Lys Lys Asp 
20 25 30 



Pro Lys Asp Ser Arg Pro Lys Leu Pro Gin Thr Leu Ser Arg Gly Trp 
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35 40 45 



Gly Asp Gin Leu lie Trp Thr Gin Thr Tyr Glu Glu Ala Leu Tyr Arg 
5 50 55 60 



Ser Lys Thr Ser Asn Arg Pro Leu Met Val He His His Leu Asp Glu 
65 " 70 75 80 

10 

Cys Pro His Ser Gin Ala Leu Lys Lys Val Phe Ala Glu His Lys Glu 
85 90 95 

15 

He Gin Lys Leu Ala Glu Gin Phe Val Leu Leu Asn Leu Val Tyr Glu 
100 105 HO 



20 Thr Thr Asp Lys His Leu Ser Pro Asp Gly Gin Tyr Val Pro Arg He 
115 120 125 



Val Phe Val Asp Pro Ser Leu Thr Glu Arg Ala Asp He Thr Gly Arg 
25 130 135 140 



Tyr Ser Asn Arg Leu Tyr Ala Tyr Glu Pro Ser Asp Thr Ala Leu Leu 
145 150 155 160 

30 

Tyr Asp Asn Met Lys Lys Ala Leu Lys Leu Leu Lys Thr Glu Leu 
165 170 175 

35 

<210> 3 

<211> 175 

<212> PRT 

<213> Mus musculus 

40 

<400> 3 

Met Glu Lys Phe Ser Val Ser Ala He Leu Leu Leu Val Ala He Ser 
15 10 15 

45 

Gly Thr Leu Ala Lys Asp Thr Thr Val Lys Ser Gly Ala Lys Lys Asp 
20 25 30 

50 

Pro Lys Asp Ser Arg Pro Lys Leu Pro Gin Thr Leu Ser Arg Gly Trp 
35 40 45 



55 Gly Asp Gin Leu He Trp Thr Gin Thr Tyr Glu Glu Ala Leu Tyr Arg 
50 55 60 



Ser Lys Thr Ser Asn Arg Pro Leu Met Val He His His Leu Asp Glu 
60 65 70 75 80 



Cys Pro His Ser Gin Ala Leu Lys Lys Val Phe Ala Glu His Lys Glu 

85 90 95 

65 

He Gin Lys Leu Ala Glu Gin Phe Val Leu Leu Asn Leu Val Tyr Glu 
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100 105 110 



Thr Thr Asp Lys His Leu Ser Pro Asp Gly Gin Tyr Val Pro Arg lie 
5 115 120 125 



Val Phe Val Asp Pro Ser Leu Thr Val Arg Ala Asp lie Thr Gly Arg 
130 135 140 

10 

Tyr Ser Asn Arg Leu Tyr Ala Tyr Glu Pro Ser Asp Thr Ala Leu Leu 
145 150 155 160 

15 

Tyr Asp Asn Met Lys Lys Ala Leu Lys Leu Leu Lys Thr Glu Leu 
165 170 175 



20 <210> 4 

<211> 175 

<212> PRT 

<213> Homo sapiens 

25 <400> 4 

Met Glu Lys lie Pro Val Ser Ala Phe Leu Leu Leu Val Ala Leu Ser 
15 10 15 

30 

Tyr Thr Leu Ala Arg Asp Thr Thr Val Lys Pro Gly Ala Lys Lys Asp 
20 25 30 



35 Thr Lys Asp Ser Arg Pro Lys Leu Pro Gin Thr Leu Ser Arg Gly Trp 
35 40 45 



Gly Asp Gin Leu He Trp Thr Gin Thr Tyr Glu Glu Ala Leu Tyr Lys 
40 50 55 60 



Ser Lys Thr Ser Asn Lys Pro Leu Met He He His His Leu Asp Glu 
65 70 75 80 

45 

Cys Pro His Ser Gin Ala Leu Lys Lys Val Phe Ala Glu Asn Lys Glu 
85 90 95 

50 

He Gin Lys Leu Ala Glu Gin Phe Val Leu Leu Asn Leu Val Tyr Glu 
100 105 110 



55 Thr Thr Asp Lys His Leu Ser Pro Asp Gly Gin Tyr Val Pro Arg He 
115 120 125 



Met Phe Val Asp Pro Ser Leu Thr Val Arg Ala Asp He Thr Gly Arg 
60 130 135 140 



Tyr Ser Asn Arg Leu Tyr Ala Tyr Glu Pro Ala Asp Thr Ala Leu Leu 
145 150 155 160 

65 

Leu Asp Asn Met Lys Lys Ala Leu Lys Leu Leu Lys Thr Glu Leu 
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165 



<210> 5 

5 <211> noi 

<212> DNA 

<213> Homo sapiens 

<400> 5 

10 ccgcatccta gccgccgact cacacaaggc 
gagaaaattc cagtgtcagc attcttgctc 



gataccacag tcaaacctgg agccaaaaag 

15 

cagaccctct ccagaggttg gggtgaccaa 
ctatataaat ccaagacaag caacaaaccc 
20 ccacacagtc aagctttaaa gaaagtgttt 
gagcagtttg tcctcctcaa tctggtttat 
ggccagtatg tccccaggat tatgtttgtt 

25 

actggaagat attcaaatcg tctctatgct 
gacaacatga agaaagctct caagttgctg 
30 aagcccttct gtctgtcagg ccttgagact 
tagtgtggaa gcatagtgaa cacactgatt 
ttttaagaaa aacaagtttt agaaatttgg 

35 

gtatactacc atagtgagcc atgattttct 
ctgttttctc caacttggtc tttcacagtg 
40 caaaatgctc aaggaaggga caagacaaaa 
gaccaagtta tcatctcacc acaccacagg 
gcttaatcaa cagaagtatc aagccatgtg 

45 

tcccaagaaa atcacatcac tacctagagt 



ctttcattta gtaattaaat tttagtcaga 
50 cctctggcaa gtagctttct ccttcagagg 
catctgcact cctgaacaca ccctgaagaa 



tctgtcaagg tcctaaagta ctggagtgaa 

55 

agaagagcaa agggtggtga cgtgttgatg 

ttaggaaacg tgaaaggctg tggcatcagg 

60 gaattgtgtg ttaatgtctt cactctatac 

cctctactgc ccagcccctc ctgatttctt 

gctttgcagt atctgttgct tgtcttgatt 

65 

gaaaaaaaaa aaaaaaaaaa a 



170 175 



aggtgggtga ggaaatccag 


agttgccatg 


60 


cttgtggccc tctcctacac 


tctggccaga 


120 


gacacaaagg actctcgacc 


caaactgccc 


180 


ctcatctgga ctcagacata 


tgaagaagct 


240 


ttgatgatta ttcatcactt 


ggatgagtgc 


300 


gctgaaaata aagaaatcca 


gaaattggca 


360 


gaaacaactg acaaacacct 


ttctcctgat 


420 


gacccatctc tgacagttag 


agccgatatc 


480 


tacgaacctg cagatacagc 


tctgttgctt 


540 


aagactgaat tgtaaagaaa 


aaaaatctcc 


600 


tgaaaccaga agaagtgtga 


gaagactggc 


660 


aggttatggt ttaatgttac 


aacaactatt 


720 


tttcaagtgt acatgtgtga 


aaacaatatt 


780 


aaaaaaaaaa ataaatgttt 


tgggggtgtt 


840 


gttcgtttac caaataggat 


taaacacaca 


900 


ccaaaactag ttcaaatgat 


gaagaccaaa 


960 


ttctcactag atgactgtaa 


gtagacacga 


1020 


ctttagcata aaagaatatt 


tagaaaaaca 


1080 


caactctggc caggaactct 


aaggtacaca 


1140 


ttttgcccaa cctaatgctc 


tcagggaaag 


1200 


tctaatttag tagaaaggtc 


atccaaagaa 


1260 


atcctgggaa ttgaccttgt 


aatcgatttg 


1320 


ataaattcag ccaacatgtg 


actaattgga 


1380 


aggcagatgg agatcagagg 


ttactagggt 


1440 


gtaggggagc attctgccta 


acagaaatta 


1500 


ttaatctcac attcattaat 


atatggaatt 


1560 


tggcccctgg actatggtgc 


tgtatataat 


1620 


aacttttttg gataaaacct 


tttttgaaca 


1680 






1701 



128 



WO 2004/056858 



PCT/EP2003/014834 



<210> 6 

<211> 781 

<212> DNA 

5 <2l'3> Mus musculus 

<400> 6 

ggcaaccctt gcggctcaca caaagcagga gggtgggaag cccagatttg ccatggagaa 60 
10 attttcagtg tctgcaatcc tgcttcttgt ggccatttct ggtaccttgg ccaaagacac 120 
cacagtcaaa tctggagcca aaaaggaccc aaaggactct cggcccaaac tacctcagac 180 
actctccaga ggttggggcg atcagctcat ctggactcag acatacgaag aagctttata 240 

15 

cagatccaag acaagcaaca gacccttgat ggtcattcat cacttggacg aatgcccaca 300 
cagtcaagcc ttaaagaaag tgtttgctga acataaagaa atccagaaat tggcagagca 360 
20 gtttgttctc ctcaacctgg tctatgaaac aaccgacaag cacctttctc ctgatggcca 420 
gtacgtcccc agaattgtgt ttgtagaccc atccctgacg gtgagggcag acatcactgg 480 
acgatactca aaccggctct acgcttatga accttctgac acagctttgt tgtacgacaa 540 

25 

catgaagaaa gctctcaagc tgctaaagac agaattgtag agctaactgc gcaccgggtc 600 
aggagaccag aaggcagaag cactgtggac ttgcagatta cagtacagtt taatgttaca 660 
30 acagatatat tttttaaaca cccacaggtg gggaaacaat attattatct actacagtga 720 
agcatgattt tctagaaaat aaagtcttgt gagaactcca aaaaaaaaaa aaaaaaaaaa 780 
a 781 

35 

<210> 7 
<211> 20 
<212> DNA 
40 <213> artifical 

<220> 

<223> mAgr2-7 primer 
45 <400> 7 

cagacccttg atggtcattc 20 

<210> 8 

50 <211> 21 

<212> DNA 

<213> artifical 

<220> 

55 <223> mAgr2-2 primer 
<400> 8 

gtctcctgac ccggtgcgca g 21 



60 



65 



<210> 9 

<211> 21 

<212> DNA 

<213> artifical 

<220> 

<223> hAgrl primer 
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<400> 9 

gaacctgcag atacagctct g 

5 

<210> 10 

<211> 21 

<212> DNA 

<213> artifical 

10 

<220> 

<223> hAgr4 primer 



15 


<400> 10 

cacactagcc agtcttctca 


c 


20 


<210> 11 
<211> 702 
<212> DNA 
<213> Mus 


musculus 




25 


<220> 

<221> misc feature 
<222> (319) (319) 
<223> n is a, c, g, or t 


OK) 


<400> 11 
ctaaactgcg 


tttctctccc 


aatcttttgc 




actttctctt 


tttcttttgc 


acaagaagaa 


35 


cagccatttc 


accaggagaa 


cacgttgaat 


actaaggatg 


atcgtcttgc 


ccaggtgtct 




tttatgagag 


actttcagtg 


ccctttctac 


40 


caccatgggg 


agcgaaaang 


ttaaaaaaaa 




taagcagact 


ttgccttttc 


acaaaggtgg 


45 


gttacttaaa 


tgaagtcttc 


ggtcagaaat 


gtgtatatat 


ttatatataa 


atatatatat 




gagttttctt 


gtatagtggc 


agaaataacc 


50 


tatgctaaac 


tttttataaa 


agtttagttg 




aagtgtgttt 


attgaatgtt 


gattgcttgc 


55 


<210> 12 
<211> 22 
<212> DNA 







<213> artifical 



60 <220> 

<223> primer 1 Idb2-SNP-marker 

<400> 12 

ctaaactgcg tttctctccc aa 

65 



<210> 13 



21 



21 



aggcatttgg ggactttttc ttttcttttt 60 

gtctacaaga tcttttaaga cttttgttat 120 

ggaccttttt aaaaagaaag cggaaggaaa 180 

tgttctccgg cctggactgt gataccgtta 240 

agttggaagg ttttctttat atactattcc 300 

aagaaaaaaa tcacaaggaa ttgcccaatg 360 

agcgtgaatt ccagaaggac ccagtattcg 420 

ggcctttttg acacgagcct actgaatgct 480 

attgagtgaa ccttgtggac tctttaatta 540 

tatttctgca ttaaaatgta atgacgtact 600 

taaacttaac ccttttatac aaaataaatc 660 

tttatttcag ac 702 



130 



WO 2004/056858 



PCT7EP2003/014834 



10 



30 



65 



<211> 25 

<212> DNA 

<213> artifical 

<220> 

<223> primer 2 Idb2-SNP-marker 

<400> 13 

gtctgaaata aagcaagcaa tcaac 25 



<210> 14 

<211> 317 

<212> DNA 

15 <213> Mus musculus 



<220> 

<221> misc_feature 

20 <222> (4).. (4) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

25 <222> (47).. (48) 

<223> n is a, c, g, or t 



<400> 14 

acgnctcact atagggcgaa ttgggccctc tagatgcatg ctcgagnngg ccgccagtgt 60 



gctggaaagc ctccttgaga tctgaacact tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 120 
tgtgtgtgtg tatgtatatg tgtataatta ttattattag ggattgaatc taggtagaca 180 
35 ttctaccaca gagacaaacc accagccctg ctcctcaaat ccttacctca atttcttttt 240 
ttcttttttt ttgttttaac cttctctttt tttattagat attgtcttca tttacatttc 300 
aaatgctatc ccaaaag 317 

40 

<210> 15 
<211> 23 
<212> DNA 
45 <213> artifical 

<220> 

<223> primer 1 D12Mit64 MIT-marker 
50 <400> 15 

ctccttgaga tctgaacact tgt 23 

<210> 16 

55 <211> 19 

<212> DNA 

<213> artifical 

<220> 

60 <223> primer 2 Dl2Mit64 MIT-marker 
<400> 16 

gggctggtgg tttgtctct 19 



<210> 17 
<211> 20 
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10 



15 



<212> DNA 
<213> artifical 

<220> 

<223> Agr2 primer 1 
<400> 17 

ggatagacca cggatggata 20 



<210> 18 

<211> 20 

<212> DNA 

<213> artifical 



<220> 

<223> Agr2 primer 2 
<400> 18 

20 ccccagagag aacctgatta 20 



<210> 19 

<211> 19 

25 <212> DNA 

<213> artifical 



<220> 

<223> Agr2 primer 3 

30 

<400> 19 

gttctctctg ggggctttt 19 

<210> 20 

35 <211> 20 

<212> DNA . 

<213> artifical 



<220> 

40 <223> Agr2 primer 4 

<400> 20 

aagatgagtg agccaaacca 20 

45 

<210> 21 

<211> 20 

<212> DNA 

<213> artifical 

50 

<220> 

<223> Agr2 primer 5 

<400> 21 

55 ggagtgaagg cagtcaacag 20 



<210> 22 

<211> 20 

60 <212> DNA 

<213> artifical 



<220> 

<223> Agr2 primer 6 

<400> 22 i 

gatgggactt ggaggagatt 20 
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<210> 23 

<211> 20 

5 <212> DNA 

<213> artifical 

<220> 

<223> Agr2 primer 7 

10 

<400> 23 

tctgtagccc cctctctctt 20 



15 <210> 24 

<211> 20 

<212> DNA 

<213> artifical 

20 <220> 

<223> Agr2 primer 8 



<210> 25 

<211> 20 

<212> DNA 

30 <213> artifical 

<220> 

<223> Agr2 primer 9 

35 <400> 25 

gctggggtag gagataggag 20 



<210> 26 

40 <2U> 20 

<212> DNA 

<213> artifical 

<220> 

45 <223> Agr2 primer 10 



<400> 24 



cactaagtcc caccgagaaa 



20 



25 



<400> 26 

atcttgccca acttcagtca 



20 



50 



<210> 
<211> 
<212> 
<213> 



artifical 



27 
20 
DNA 



55 



<220> 
<223> 



Agr2 primer 11 



60 



<400> 27 

taagcaggaa gcaggagaga 



20 



65 



<210> 28 
<211> 20 
<212> DNA 



<213> artifical 
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<220> 

<223> Agr2 primer 12 

<400> 28 

5 aatattgttt ccccacctgt 20 

<210> 29 

<211>- 1701 

10 <212> DNA 

<213> Homo sapiens 



15 



20 



30 



35 



40 



45 



50 



55 



60 



65 



<400> 29 

ccgcatccta gccgccgact cacacaaggc aggtgggtga ggaaatccag agttgccatg 60 



gagaaaattc cagtgtcagc attcttgctc cttgtggccc tctcctacac tctggccaga 120 

gataccacag tcaaacctgg agccaaaaag gacacaaagg actctcgacc caaactgccc 180 

cagaccctct ccagaggttg gggtgaccaa ctcatctgga ctcagacata tgaagaagct 240 

ctatataaat ccaagacaag caacaaaccc ttgatgatta ttcatcactt ggatgagtgc 300 

^ ccacacagtc aagctttaaa gaaagtgttt gctgaaaata aagaaatcca gaaattggca 360 

gagcagtttg tcctcctcaa tctggtttat gaaacaactg acaaacacct ttctcctgat 420 

ggccagtatg tccccaggat tatgtttgtt gacccatctc tgacagarag agccgatatc 480 

actggaagat attcaaatcg tctctatgct tacgaacctg cagatacagc tctgttgctt 540 

gacaacatga agaaagctct caagttgctg aagactgaat tgtaaagaaa aaaaatctcc 600 

aagcccttct gtctgtcagg ccttgagact tgaaaccaga agaagtgtga gaagactggc 660 

tagtgtggaa gcatagtgaa cacactgatt aggttatggt ttaatgttac aacaactatt 720 

ttttaagaaa aacaagtttt agaaatttgg tttcaagtgt acatgtgtga aaacaatatt 780 

gtatactacc atagtgagcc atgattttct aaaaaaaaaa ataaatgttt tgggggtgtt 840 

ctgttttctc caacttggtc tttcacagtg gttcgtttac caaataggat taaacacaca 900 

caaaatgctc aaggaaggga caagacaaaa ccaaaactag ttcaaatgat gaagaccaaa 960 

gaccaagtta tcatctcacc acaccacagg ttctcactag atgactgtaa gtagacacga 1020 
gcttaatcaa cagaagtatc aagccatgtg ctttagcata aaagaatatt tagaaaaaca 



1080 

tcccaagaaa atcacatcac tacctagagt caactctggc caggaactct aaggtacaca 1140 

ctttcattta gtaattaaat tttagtcaga ttttgcccaa cctaatgctc tcagggaaag 1200 

cctctggcaa gtagctttct ccttcagagg tctaatttag tagaaaggtc atccaaagaa 1260 

catctgcact cctgaacaca ccctgaagaa atcctgggaa ttgaccttgt aatcgatttg 1320 

tctgtcaagg tcctaaagta ctggagtgaa ataaattcag ccaacatgtg actaattgga 1380 

agaagagcaa agggtggtga cgtgttgatg aggcagatgg agatcagagg ttactagggt 1440 

ttaggaaacg tgaaaggctg tggcatcagg gtaggggagc attctgccta acagaaatta 1500 

gaattgtgtg ttaatgtctt cactctatac ttaatctcac attcattaat atatggaatt 1560 

cctctactgc ccagcccctc ctgatttctt tggcccctgg actatggtgc tgtatataat 1620 
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gctttgcagt atctgttgct tgtcttgatt aacttttttg gataaaacct tttttgaaca 1680 
gaaaaaaaaa aaaaaaaaaa a 1701 

5 

<210> 30 

<2U> 175 

<212> PRT 

<213> Homo sapiens 

10 

<400> 30 

Met Glu Lys lie Pro Val Ser Ala Phe Leu Leu Leu Val Ala Leu Ser 
15 10 15 

15 

Tyr Thr Leu Ala Arg Asp Thr Thr Val Lys Pro Gly Ala Lys Lys Asp 
20 25 30 

20 

Thr Lys Asp Ser Arg Pro Lys Leu Pro Gin Thr Leu Ser Arg Gly Trp 
35 40 45 

25 Gly Asp Gin Leu lie Trp Thr Gin Thr Tyr Glu Glu Ala Leu Tyr Lys 
50 55 60 

Ser Lys Thr Ser Asn Lys Pro Leu Met He He His His Leu Asp Glu 
30 65 70 75 80 

Cys Pro His Ser Gin Ala Leu Lys Lys Val Phe Ala Glu Asn Lys Glu 
85 90 95 

35 

He Gin Lys Leu Ala Glu Gin Phe Val Leu Leu Asn Leu Val Tyr Glu 
100 105 110 

40 

Thr Thr Asp Lys His Leu Ser Pro Asp Gly Gin Tyr Val Pro Arg He 
115 120 125 

45 Met Phe Val Asp Pro Ser Leu Thr Glu Arg Ala Asp He Thr Gly Arg 
130 135 140 

Tyr Ser Asn Arg Leu Tyr Ala Tyr Glu Pro Ala Asp Thr Ala Leu Leu 
50 145 150 155 160 

Leu Asp Asn Met Lys Lys Ala Leu Lys Leu Leu Lys Thr Glu Leu 
165 170 175 

55 



<210> 


31 


<211> 


23 


<212> 


DNA 


<213> 


artifical 


<220> 




<223> 


hAgr2-3 primer 


<400> 


31 



gccatggaga aaattccagt gtc 23 
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<210> 32 

<211> 26 

<212> DNA 

5 <213> artifical 

<220> 

<223> nAgr2-4 primer 1 
10 <400> 32 

tttacaattc agtcttcagc aacttg 26 

<210> 33 

15 <211> 531 

<212> DNA 

<213> artifical 

<220> 

20 <223> PCR product of hAgr2-3 and hAgr2-4 
<400> 33 

ccatggagaa aattccagtg tcagcattct tgctccttgt ggccctctcc tacactctgg 60 

25 ccagagatac cacagtcaaa cctggagcca aaaaggacac aaaggactct cgacccaaac 120 

tgccccagac cctctccaga ggttggggtg accaactcat ctggactcag acatatgaag 180 

aagctctata taaatccaag acaagcaaca aacccttgat gattattcat cacttggatg 240 

30 

agtgcccaca cagtcaagct ttaaagaaag tgtttgctga aaataaagaa atccagaaat . 300 

tggcagagca gtttgtcctc ctcaatctgg tttatgaaac aactgacaaa cacctttctc 360 

35 ctgatggcca gtatgtcccc aggattatgt ttgttgaccc atctctgaca gttagagccg 420 

atatcactgg aagatattca aatcgtctct atgcttacga acctgcagat acagctctgt 480 

tgcttgacaa catgaagaaa gctctcaagt tgctgaagac tgaattgtaa a 531 



40 

<210> 34 

<211> 19 

<212> PRT 

45 <213> artifical 

<220> 

<223> Agr2 epitope exon 11 
50 <400> 34 

Thr Val Lys Ser Gly Ala Lys Lys Asp Pro Lys Asp, Ser Arg Pro Lys 
15 10 15 



55 



Leu Pro Gin 
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An isolated protein having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 
98%, or 99% amino acid identity compared to the mouse Agr2 or the 
human AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4, 
respectively, or an isolated fragment of such protein comprising at least 6, 
7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 
140, 150, 160, 165, 170, 171, 172, 173, or 174 contiguous amino acids 
having said percentages of amino acid identity compared to the 
corresponding amino acids in SEQ ID NO:3 and SEQ ID NO:4, wherein 
said protein or fragment of such protein comprises an amino acid or an 
amino acid sequence which corresponds to 

(a) a mutation in the mouse Agr2 protein as defined above which, if 
encoded by the mouse Agr2 gene and present in the genome of all 
or essentially all cells of a mouse in a homozygous manner, results 
in a phenotype associated with an alteration in goblet cell function 
compared to the corresponding wild-type animal; and/or 

(b) a mutation in the mouse Agr2 protein or the human AGR2 protein 
as defined above which leads to an altered biological activity of the 
mutated protein when compared to the corresponding wild-type 
mouse Agr2 protein or human AGR2 protein in an in vitro assay 
selected from the group consisting of a colon cell proliferation 
assay, a goblet cell mucus secretion assay, and a Xenopus laevis 
cement gland differentiation assay; and/or 

(c) a mutation of the human AGR2 protein as defined above which is 
indicative of an increased risk of a human subject of developing a 
medical condition associated with an alteration in goblet cell 
function, or indicative of an association of a medical condition in a 
human subject which is associated with an alteration in goblet cell 
function with altered AGR2 expression or function. 

The isolated protein or protein fragment according to claim 1, wherein said 
protein represents an orthologue of the mouse Agr2 or the human AGR2 
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protein, preferably a vertebrate orthologue, in particular an orthologue 
wherein said vertebrate is Xenopus leavis, or a mammalian orthologue, in 
particular an orthologue wherein said vertebrate is selected from the group 
consisting of a mouse, rat, rabbit, hamster, dog, cat, sheep, and horse. 

3. The isolated protein or protein fragment according to claims 1 or 2, 
wherein said alteration results in a loss of function phenotype. 

4. The isolated protein or protein fragment according to claims 1 or 2, 
wherein said alteration results in a gain of function phenotype. 

5. The isolated protein or protein fragment according to any one of claims 1 
to 4, wherein said alteration is an alteration in goblet cell differentiation, 
particularly terminal differentiation and/or goblet cell mucus production or 
secretion and/or mucus composition. 

6. The isolated protein or protein fragment according to any one of claims 1 
to 3 or 5, wherein said alteration is characterized by a reduction in pre- 
mucin storing granules in the goblet cells, an altered mucus secretion, 
secondary inflammatory infiltrations in the intestinal mucosal epithelium 
and submucosa. 

7. The isolated protein or protein fragment according to any one of claims 1 
to 3 or 5 to 6, wherein said phenotype is furthermore associated with an 
increased proliferation of the glandular epithelium of the Brunner's gland. 

8. The isolated protein or protein fragment according to any one of claims 1 
to 3 or 5 to 7, wherein said alteration results in diarrhea, or diarrhea and a 
thriving deficit. 

9. The isolated protein or protein fragment according to any one of claims 1 
to 4, wherein said medical condition is selected from the group consisting 
of asthma, chronic obstructive pulmonary disease (COPD), cystic fibrosis, 
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dry eye syndrome, gastric disease, peptic ulcer, inflammatory bowel 
disease, in particular Crohn's disease or ulcerative colitis, and intestinal 
cancer. 

5 10. The isolated protein or protein fragment according to any one of claims 1 
to 9, wherein said mutation results in a deletion or substitution by another 
amino acid of an amino acid of said mouse Agr2 protein or human AGR2 
protein, or an insertion of additional amino acids not normally present in 
the amino acid sequence of said mouse Agr2 protein or said human AGR2 

10 protein. 

11. The isolated protein or protein fragment according to claim 10, wherein 
said deletion, substitution, or insertion occurs in an evolutionary conserved 
region of said mouse Agr2 protein or said human AGR2 protein. 

15 

12. The isolated protein or protein fragment according to claims 10 or 11, 
wherein said mutation results in the substitution of an amino acid which is 
identical or similar between mouse, rat, and human AGR2, preferably 
between mouse, rat, human, and Xenopus laevis AGR2, more preferably 

20 between mouse, rat, human, Xenopus laevis, and Caenorhabditis elegans 

AGR2, by another amino acid. 

13. The isolated protein or protein fragment according to any one of claims 10 
to 12, wherein the substitution of said amino acid of said mouse Agr2 

25 protein or said human AGR2 protein by another amino acid is a non- 

conservative substitution. 

14. The isolated protein or protein fragment according to any one of claims 10 
to 13, wherein the amino acid of said mouse Agr2 protein or said human 

30 AGR2 protein that is deleted or substituted is Val 1 37. 

1 5. The isolated protein or protein fragment according to claim 14, wherein the 
substitution at position 137 is one of the following substitutions: 

139 



a) Val -> acidic amino acid such as Glu or Asp; 

b) Val -> basic amino acid, such as His, Arg or Lys; 

c) Val -> aliphatic hydroxyl side chain amino acid, such as Ser or 
Thr; 

d) Val -> amide side chain amino acid, such as Asn or Gin; 

e) Val -> sulfur containing side chain amino acid, such as Cys or Met; 

f) Val -> aromatic side chain amino acid, such as Phe, Tyr, Trp; 

g) Val -> Gly or Pro; and 

h) Val Ala, Leu or He. 

The isolated protein or protein fragment according to claim 15, wherein the 
substitution at position 137 is a substitution of valine by glutamic acid. 

The isolated protein or protein fragment according to any one of claims 10 
to 15, wherein said amino acid is substituted by a naturally occurring 
amino acid. 

An isolated protein having the amino acid sequence set forth in SEQ ID 
NO:2 or SEQ ID NO:30, or an isolated fragment of such protein 
comprising at least 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 
100, 110, 120, 130, 140, 150, 160, 165, 170, 171, 172, 173, or 174 
contiguous amino acids of said amino acid sequence, said contiguous 
amino acids comprising an amino acid corresponding to Glu 137. 

A fusion protein comprising a protein or protein fragment according to any 
one of claims 1 to 18 fused to another protein or protein fragment not 
having said percentages of amino acid sequence identity to any 
corresponding amino acids in SEQ ID NO:3 and SEQ ID NO:4. 

The fusion protein of claim 19, wherein said other protein is a protein 
unrelated to the mouse Agr2 or the human AGR2 protein according to 
SEQ ID NO:3 and SEQ ID NO:4, respectively. 
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2 1 . The fusion protein according to claims 1 9 or 20, wherein said other protein 
is selected from the group consisting of glutathione-S-transferase, an 
immunoglobulin peptide, a polyhistidine peptide, a FLAG tag, and 
streptavidin. 

5 

22. An isolated nucleic acid encoding a protein or a fragment of such protein 
according to any one of claims 1 to 18, or an isolated nucleic acid which is 
complementary thereto. 

10 23. An isolated nucleic acid having the nucleotide sequence set forth in SEQ 
ID NO:l or SEQ ID NO:29, or an isolated nucleic acid which is 
complementary thereto. 

24. An episomal element comprising a nucleic acid as defined in any one of 
15 claims 22 or 23. 

25. The episomal element according to claim 24, wherein said episomal 
element is selected from a plasmid, a cosmid, a bacterial phage nucleic 
acid, or a viral nucleic acid. 

20 

26. A genome comprising a nucleic acid as defined in any one of claims 22 or 
23. 

27. The genome according to claim 26, wherein said genome is a 
25 bacteriophage genome, a bacteria genome, or a virus genome. 

28. The genome of claim 27, wherein said virus genome is a DNA viral 
genome or an RNA viral genome. 

30 29. A vector comprising a nucleic acid molecule encoding the protein 
according to any one of claims 1 to 21. 
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30. The vector according to claim 29, wherein said vector is selected from the 
group consisting of an expression vector, a mutagenesis vector, an 
integration vector and a mutation vector. 

5 31. The vector according to claim 30, wherein said vector is an expression 
vector and wherein the sequence encoding said protein is operably linked 
to a promoter sequence. 

32. The vector according to claims 30 or 31, wherein said vector is an 
10 expression vector and is selected from the group consisting of a plasmid 

vector, a cosmid vector, a phage vector, a phagemid vector, a viral vector, 
and a retroviral vector. 



33. A host cell transfected with the episomal element, genome, or vector of 
15 any one of claims 24 to 32. 

34. The host cell according to claim 33, wherein said host cell is a eukaryotic 
cell. 



20 35. The host cell according to claim 33, wherein said host cell is a prokaryotic 
cell. 



36. An antisense nucleic acid comprising a nucleotide sequence which is 
complementary to 

25 (i) a part of an mRNA encoding a protein according to any one of 

claims 1 to 21, said part encoding an amino acid sequence 
comprising the amino acid or amino acid sequence which 
corresponds to 

(a) the mutation in the mouse Agr2 protein according to SEQ 
30 ID NO:3 which, if encoded by the mouse Agr2 gene and 

present in the genome of all or essentially all cells of a 
mouse in a homozygous manner, results in a phenotype 
associated with an alteration in goblet cell function 
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compared to the corresponding wild-type animal, said 
phenotype optionally being furthermore associated with an 
increased proliferation of the glandular epithelium of the 
Brunner's gland; and/or 

(b) the mutation in the mouse Agr2 protein or the human AGR2 
protein according to SEQ ID NO:3 and SEQ ID NO:4, 
respectively, which leads to an altered biological activity of 
the mutated protein when compared to the corresponding 
wild-type mouse Agr2 protein or human AGR2 protein in 
an in vitro assay selected from the group consisting of a 
colon cell proliferation assay, a goblet cell mucus secretion 
assay, and a Xenopus laevis cement gland differentiation 
assay; and/or 

(c) the mutation of the human AGR2 protein according to SEQ 
ID NO:4 which is indicative of an increased risk of a human 
subject of developing a medical condition associated with 
an alteration in goblet cell function, or indicative of an 
association of a medical condition in a human subject which 
is associated with an alteration in goblet cell function with 
altered AGR2 expression or function; 

a part of the niRNA encoding the mouse Agr2 or the human AGR2 
protein according to SEQ ID NO:3 and SEQ ID NO:4, 
respectively, or an orthologue thereof having at least 65%, 70%, 
75%, 80%, 85%, 90%, 95%, 98%, or 99% amino acid identity 
compared to the mouse Agr2 or the human AGR2 protein as 
defined above, said part being a non-coding part and comprising a 
sequence corresponding to a mutation in the gene coding for said 
protein or orthologue which affects expression of said protein or 
orthologue; or 

a part of the raRNA encoding a protein which affects expression or 
function of the mouse Agr2 or the human AGR2 protein according 
to SEQ ID NO:3 and SEQ ID NO:4, respectively, or an orthologue 
thereof having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 
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98%, or 99% amino acid identity compared to the mouse Agr2 or 
the human AGR2 protein according to SEQ ID NO:3 and SEQ ID 
NO:4, respectively. 

37. The antisense nucleic acid of claim 36, wherein said antisense nucleic acid 
is selected from the group consisting of DNA, RNA, and a synthetic 
nucleic acid analog, such as a PNA (peptide nucleic acid). 

38. The antisense nucleic acid of claims 36 or 37, wherein said antisense 
nucleic acid is capable of hybridizing to said mRNA via said 
complementary nucleotide sequence under physiological conditions, or 
under conditions of high stringency, preferably under hybridization 
conditions of a high salt buffer comprising 6x SSC, 50 mM Tris-HCl (pH 
7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml 

15 denatured salmon sperm DNA at 65°C, followed by one or more washes in 

0.2x SSC, 0.01% BSA at 50°C, furthermore preferably under hybridization 
conditions of a high salt buffer comprising 6x SSC, 50 mM Tris-HCl (pH 
7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml 
denatured salmon sperm DNA at 65°C, followed by one or more washes in 

20 0.2x SSC, 0.01% BSA at 65°C. 

39. The antisense nucleic acid of any one of claims 36 to 38, wherein said 
antisense nucleic acid is a ribozyme and further comprises a catalytic 
region. 

25 

40. The antisense nucleic acid according to claim 39, wherein said catalytic 
region is capable of cleaving said mRNA. 

41. The antisense nucleic acid of any one of claims 38 to 40, wherein said 
30 hybridization to said mRNA is more effective than hybridization to 

(i) the mRNA encoding the same protein which, however, corresponds 
to the wild-type mouse Agr2 or human AGR2 protein according to 
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SEQ ID NO:3 and SEQ ID N0:4 in respect of said amino acid 
sequence; 

(ii) the mRNA encoded by the wild-type gene of the mouse Agr2 or 
human AGR2 protein as defined above, or the wild-type gene of the 
corresponding orthologue; or 

(iii) the mRNA encoded by the wild-type gene of the corresponding 
protein which affects expression or function of the mouse Agr2 or 
the human AGR2 protein as defined above. 

42. A host cell transformed with an antisense nucleic acid according to any 
one of claims 36 to 41 . 

43. The host cell according to claim 42, wherein said host cell is a eukaryotic 
cell. 

44. The host cell according to claim 42, wherein said host cell is a prokaryotic 
ceU. 

45. A short interfering RNA (siRNA) comprising a double stranded nucleotide 
sequence wherein one strand is complementary to an at least 19, 20, 21, 
22, 23, 24, or 25 nucleotide long segment of an mRNA encoding 

(a) the mouse Agr2 or the human AGR2 protein according to SEQ ID 
NO:3 and SEQ ID NO:4, respectively, or an orthologue thereof 
having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 
99% amino acid identity compared to the mouse Agt2 or the 
human AGR2 protein according to SEQ ID NO:3 and SEQ ID 
NO:4, respectively; or 

(b) a protein which affects expression or function of the mouse Agr2 or 
the human AGR2 protein according to SEQ ID NO:3 and SEQ ID 
NO:4, respectively, or an orthologue thereof having at least 65%, 
70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% amino acid 
identity compared to the mouse Agr2 or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4, respectively. 
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46. The siRNA of claim 45, wherein said siRNA is capable of silencing or 
suppressing the expression of the AGR2 gene encoding said mRNA. 

5 47. The siRNA of claims 45 or 46, wherein said AGR2 gene is a vertebrate 
AGR2 gene, in particular a Xenopus leavis AGR2 gene, or a mammalian 
AGR2 gene, in particular an AGR2 gene selected from the group 
consisting of the AGR2 gene of human, mouse, rat, rabbit, hamster, dog, 
cat, sheep, and horse, most preferably a mouse Agr2 gene or a human 

10 AGR2 gene. 



48. The siRNA of any one of claims 45 to 47, wherein said AGR2 gene is an 
AGR2 gene of a human subject unaffected by or known not to be at risk of 
developing a condition associated with an alteration in goblet cell function. 

15 

49. A short interfering RNA (siRNA) comprising a double stranded nucleotide 
sequence wherein one strand is complementary to an at least 19, 20, 21, 
22, 23, 24, or 25 nucleotide long segment of an mRNA encoding 

(i) a protein according to any one of claims 1 to 21, said segment 
20 encoding an amino acid sequence comprising the amino acid or 

amino acid sequence which corresponds to 

(a) the mutation in the mouse Agr2 protein according to SEQ 
ID NO:3 which, if encoded by the mouse Agr2 gene and 
present in the genome of all or essentially all cells of a 

25 mouse in a homozygous manner, results in a phenotype 

associated with an alteration in goblet cell function 
compared to the corresponding wild-type animal, said 
phenotype optionally being furthermore associated with an 
increased proliferation of the glandular epithelium of the 

30 Brunner's gland; and/or 

(b) the mutation in the mouse Agr2 protein or the human AGR2 
protein according to SEQ ID NO:3 and SEQ ID NO:4, 
respectively, which leads to an altered biological activity of 
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the mutated protein when compared to the corresponding 
wild-type mouse Agr2 protein or human AGR2 protein in 
an in vitro assay selected from the group consisting of a 
colon cell proliferation assay, a goblet cell mucus secretion 
assay, and a Xenopus laevis cement gland differentiation 
assay; and/or 

(c) the mutation of the human AGR2 protein according to SEQ 
ID NO:4 which is indicative of an increased risk of a human 
subject of developing a medical condition associated with 
an alteration in goblet cell function, or indicative of an 
association of a medical condition in a human subject which 
is associated with an alteration in goblet cell function with 
altered AGR2 expression or function; 

(ii) the mouse Agr2 or the human AGR2 protein according to SEQ ID 
NO:3 and SEQ ID NO:4, respectively, or an orthologue thereof 
having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 
99% amino acid identity compared to the mouse Agr2 or the 
human AGR2 protein as defined above, said segment being a non- 
coding segment and comprising a sequence corresponding to a 
mutation in the gene coding for said protein or orthologue which 
affects expression of said protein or orthologue; or 

(iii) a protein which affects expression or function of the mouse Agr2 
or the human AGR2 protein according to SEQ ID NO:3 and SEQ 
ID NO:4, respectively, or an orthologue thereof having at least 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% amino acid 
identity compared to the mouse Agr2 or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4, respectively. 

The siRNA according to any one of claims 45 to 49, wherein said segment 
includes sequences from the 5' untranslated (UT) region, the open reading 
frame (ORF), or the 3' UT region of said mRNA. 
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51 . A host cell transformed with an siRNA according to any one of claims 45 
to 50, 

52. The host cell according to claim 51, wherein said host cell is a eukaryotic 
5 cell. 

53. The host cell according to claim 51, wherein said host cell is a prokaryotic 
cell. 

10 54. An antibody specifically recognizing an epitope in a protein according to 
any one of claims 1 to 21, wherein said epitope comprises the amino acid 
or the amino acid sequence in said protein which corresponds to 

(a) the mutation in the mouse Agr2 protein according to SEQ ID NO:3 
which, if encoded by the mouse Agr2 gene and present in the 

15 genome of all or essentially all cells of a mouse in a homozygous 

manner, results in a phenotype associated with an alteration in 
goblet cell function compared to the corresponding wild-type 
animal, said phenotype optionally being furthermore associated 
with an increased proliferation of the glandular epithelium of the 

20 Brunner's gland; and/or 

(b) the mutation in the mouse Agr2 protein or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4, respectively, which 
leads to an altered biological activity of the mutated protein when 
compared to the corresponding wild-type mouse Agr2 protein or 

25 human AGR2 protein in an in vitro assay selected from the group 

consisting of a colon cell proliferation assay, a goblet cell mucus 
secretion assay, and zXenopus laevis cement gland differentiation 
assay; and/or 

(c) the mutation of the human AGR2 protein according to SEQ ID 
30 NO:4 which is indicative of an increased risk of a human subject of 

developing a medical condition associated with an alteration in 
goblet cell function, or indicative of an association of a medical 
condition in a human subject which is associated with an alteration 
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in goblet cell function with altered AGR2 expression or function; 
and/or 

(d) Glul37inSEQIDNO:2andSEQIDNO:30. 

5 55. The antibody according to claim 54, wherein said antibody is a high 
affinity antibody. 

56. The antibody according to claims 54 or 55, wherein the antibody is 
selected from the group consisting of a polyclonal antibody, a monoclonal 

10 antibody, a chimeric antibody, a single-chain antibody, and an F^, F^s or 

F(ab')2 fragment. 

57. The antibody according to any one of claims 54 to 56, wherein said 
antibody is of a class selected from IgG, IgM, IgA, IgE, and IgD. 

15 

58. The antibody according to claim 57, wherein said antibody is of the class 
IgGi or IgG 2 . 

59. The antibody according to any one of claims 54 to 58, wherein said 
20 antibody is a humanized antibody or a human antibody. 

60. The antibody according to any one of claims 54 to 59, wherein said 
antibody is a bispecific antibody, preferably an antibody wherein one of 
the binding specificities is for said epitope and the other binding specificity 

25 is for a cell-surface protein, such as a cell surface receptor or a cell surface 

receptor subunit. 

61. The antibody according to any one of claims 54 to 59, wherein said 
antibody is covalently joined to another antibody to form a heteroconjugate 

30 antibody. 

62. The antibody according to any one of claims 54 to 61, wherein said 
antibody is modified regarding its effector function. 
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An immunoconjugate comprising an antibody according to any one of 
claims 54 to 62 conjugated to 

(a) a cytotoxic agent; 

(b) a receptor or ligand capable of interacting with a cytotoxic agent or 
with a ligand or receptor bound to a cytotoxic agent; or 

(c) to an imaging agent. 

The immunoconjugate according to claim 63, wherein the cytotoxic agent 
is selected from a chemotherapeutic agent, a toxin, or a radioactive isotope. 

The immunoconjugate according to claims 63 or 64, wherein said receptor 
is streptavidin and said ligand bound to the cytotoxic agent is avidin. 

The immunoconjugate according to claim 63, wherein the imaging agent is 
a radioactive isotope, preferably a radioactive isotope selected from 18 F, 
"Cu, 67 Ga, 68 Ga, 99m Tc, in In, 123 I, 125 I, 131 I, 169 Yb, 186 Re, and 201 T1, 
preferably 99m Tc. 

The immunoconjugate according to claim 63 or 66, wherein said imaging 
agent is complexed to a chelating group which is covalently attached to the 
antibody. 

An anticalin specifically binding an epitope in a protein according to any 
one of claims 1 to 21, wherein said epitope comprises the amino acid or 
the amino acid sequence in said protein which corresponds to 
(a) the mutation in the mouse Agr2 protein according to SEQ ID NO:3 
which, if encoded by the mouse Agr2 gene and present in the 
genome of all or essentially all cells of a mouse in a homozygous 
manner, results in a phenotype associated with an alteration in 
goblet cell function compared to the corresponding wild-type 
animal, said phenotype optionally being furthermore associated 
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with an increased proliferation of the glandular epithelium of the 
Brunner's gland; and/or 

(b) the mutation in the mouse Agr2 protein or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ED NO:4, respectively, which 
leads to an altered biological activity of the mutated protein when 
compared to the corresponding wild-type mouse Agr2 protein or 
human AGR2 protein in an in vitro assay selected from the group 
consisting of a colon cell proliferation assay, a goblet cell mucus 
secretion assay, and a Xenopus laevis cement gland differentiation 
assay; and/or 

(c) the mutation of the human AGR2 protein according to SEQ ID 
NO:4 which is indicative of an increased risk of a human subject of 
developing a medical condition associated with an alteration in 
goblet cell function, or indicative of an association of a medical 
condition in a human subject which is associated with an alteration 
in goblet cell function with altered AGR2 expression or function; 
and/or 

(d) GIu 137 in SEQ ID NO:2 and SEQ ID NO:30. 

An anticalin specifically binding an epitope in a protein which corresponds 
to 

(a) the mouse Agr2 or the human AGR2 protein according to SEQ ID 
NO:3 and SEQ ID NO:4, respectively, or an orthologue thereof 
having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 
99% amino acid identity compared to the mouse Agr2 or the 
human AGR2 protein according to SEQ ID NO:3 and SEQ ID 
NO:4, respectively; or 

(b) a protein which affects expression or function of the mouse Agr2 
or the human AGR2 protein according to SEQ ID NO:3 and SEQ 
ID NO:4, respectively, or an orthologue thereof having at least 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% amino acid 
identity compared to the mouse Agr2 or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4, respectively. 



An aptamer specifically binding an epitope in a protein according to any 
one of claims 1 to 21, wherein said epitope comprises the amino acid or 
the amino acid sequence in said protein which corresponds to 

(a) the mutation in the mouse Agr2 protein according to SEQ ID NO:3 
which, if encoded by the mouse Agr2 gene and present in the 
genome of all or essentially all cells of a mouse in a homozygous 
manner, results in a phenotype associated with an alteration in 
goblet cell function compared to the corresponding wild-type 
animal, said phenotype optionally being furthermore associated 
with an increased proliferation of the glandular epithelium of the 
Brunner's gland; and/or 

(b) the mutation in the mouse Agr2 protein or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4, respectively, which 
leads to an altered biological activity of the mutated protein when 
compared to the corresponding wild-type mouse Agr2 protein or 
human AGR2 protein in an in vitro assay selected from the group 
consisting of a colon cell proliferation assay, a goblet cell mucus 
secretion assay, and a Xenopus laevis cement gland differentiation 
assay; and/or 

(c) the mutation of the human AGR2 protein according to SEQ ID 
NO:4 which is indicative of an increased risk of a human subject of 
developing a medical condition associated with an alteration in 
goblet cell function, or indicative of an association of a medical 
condition in a human subject which is associated with an alteration 
in goblet cell function with altered AGR2 expression or function; 
and/or 

(d) Glu 137 in SEQ ID NO:2 and SEQ ID NO:30. 

An aptamer specifically binding an epitope in a protein which corresponds 
to 

(a) the mouse Agr2 or the human AGR2 protein according to SEQ ID 
NO:3 and SEQ ID NO:4, respectively, or an orthologue thereof 
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having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 
99% amino acid identity compared to the mouse Agr2 or the 
human AGR2 protein according to SEQ ID NO:3 and SEQ ID 
NO:4, respectively; or 
(b) a protein which affects expression or function of the mouse Agr2 
or the human AGR2 protein according to SEQ ID NO:3 and SEQ 
ID NO:4, respectively, or an orthologue thereof having at least 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% amino acid 
identity compared to the mouse Agr2 or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ED NO:4, respectively. 

A non-human vertebrate animal comprising in the genome of at least some 
of its cells an allele of a gene encoding a protein having at least 65%, 70%, 
75%, 80%, 85%, 90%, 95%, 98%, or 99% amino acid identity compared to 
the mouse Agr2 or the human AGR2 protein according to SEQ ID NO:3 
and SEQ ID NO:4, respectively, said allele comprising a mutation which, 

a) if present in the genome of all or essentially all cells of said animal 
in a homozygous manner, results in a phenotype associated with an 
alteration in goblet cell function compared to the corresponding 
wild-type animal; and/or 

b) corresponds to a mutation in the mouse Agr2 protein or the human 
AGR2 protein as defined above which leads to an altered biological 
activity of the mutated protein when compared to the corresponding 
wild-type mouse Agr2 protein or human AGR2 protein in an in 
vitro assay selected from the group consisting of a colon cell 
proliferation assay, a goblet cell mucus secretion assay, and a 
Xenopus laevis cement gland differentiation assay; and/or 

c) corresponds to a mutation of the human AGR2 protein as defined 
above which is indicative of an increased risk of a human subject of 
developing a medical condition associated with an alteration in 
goblet cell function, or indicative of an association of a medical 
condition in a human subject which is associated with an alteration 
in goblet cell function with altered AGR2 expression or function. 
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73. A non-human vertebrate animal comprising in the genome of at least some 
of its cells an allele of a gene coding for a protein which affects expression 
or function of the AGR2 protein of said animal, said allele comprising a 
5 mutation which, if present in the genome of all or essentially all cells of 

said animal in a homozygous manner, results in a phenotype associated 
with an alteration in goblet cell function compared to the corresponding 
wild-type animal. 



10 74, The animal according to claims 72 or 73, wherein said alteration results in 
a loss of function phenotype. 

75. The animal according to claims 72 or 73, wherein said alteration results in 
a gam of function phenotype. 

15 

76. The animal according to any one of claims 72 to 75, wherein said 
alteration is an alteration in goblet cell differentiation, particularly terminal 
differentiation, and/or goblet cell mucus production or secretion and/or 
mucus composition. 

20 

77. The animal according to any one of claims 72 to 74, or 76, wherein said 
alteration is characterized by a reduction in pre-mucin storing granules in 
the goblet cells, an altered mucus secretion, and secondary inflammatory 
infiltrations in the intestinal mucosal epithelium and submucosa. 

25 

78. The animal according to any one of claims 72 to 74 or 76 to 77, wherein 
said phenotype is furthermore associated with an increased proliferation of 
the glandular epithelium of the Brunner's gland. 

30 79. The animal according to any one of claims 72 to 74, or 76 to 78, wherein 
said alteration results in diarrhea, or diarrhea and a thriving deficit. 
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80. The animal according to any one of claims 72 to 79, wherein said gene is 
an endogenous gene with respect to said animal. 

81. The animal according to any one of claims 72 or 74 to 80, wherein said 
5 gene encodes a protein which is an orthologue of SEQ ID NO:3 and SEQ 

ID NO:4 with respect to said animal. 

82. The animal according to any one of claims 72 to 79, wherein said gene is a 
heterologous gene with respect to said animal. 

10 

83. The animal according to any one of claims 72 or 74 to 82 wherein said 
gene encodes a protein according to any one of claims 1 to 18. 

84. The animal according to claim 83 wherein said gene encodes a protein 
15 having the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:30. 

85. The animal according to any one of claims 72 to 84, wherein said animal is 
a transgenic animal. 

20 86. The animal according to any one of claims 72 to 85, wherein said mutation 
results in the reduction or complete abolishment of expression of said 
gene. 

87. The animal according to any one of claims 72 to 86, wherein said allele is 
25 expressed under the control of a promoter other than the endogenous 

promoter of said gene. 

88. The animal according to claim 87, wherein said promoter has a tissue 
specificity other than that of the endogenous promoter of said gene. 



30 



89. The animal according to claims 87 or 88, wherein said promoter is an 
inducible promoter. 
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90. The animal according to any one of claims 72 to 89, wherein said cells are 
the germ cells of said animal. 

91. The animal according to any one of claims 72 to 89, wherein said cells are 
5 the somatic cells of said animal. 

92. The animal according to claims 90 or 91, wherein the genome of all or 
essentially all of the germ cells and the somatic cells of said animal 
comprise said allele. 

10 

93. The animal according to any one of claims 72 to 92, wherein said genome 
of said cells is homozygous in respect of said allele. 

94. The animal according to any one of claims 72 to 93, wherein said animal is 
1 5 a mammalian animal, preferably a rodent. 

95. The animal according to claim 94, wherein said animal is selected from the 
group consisting of a mouse, rat, rabbit, hamster, dog, cat, sheep, and 
horse. 

20 

96. Use of the non-human vertebrate animal according to any one of claims 72 
to 95 for the identification of a protein or nucleic acid diagnostic marker 
for a goblet cell-related disorder, or as an animal model for studying the 
molecular mechanisms of, or physiological processes associated with, a 

25 goblet cell-related disorder, or for the identification and testing of an agent 

useful in the prevention, amelioration, or treatment of a goblet cell-related 
disorder. 

97. The use according to claim 96, wherein said goblet cell-related disorder is 
30 selected from the group consisting of asthma, chronic obstructive 

pulmonary disease (COPD), cystic fibrosis, dry eye syndrome, gastric 
disease, peptic ulcer, inflammatory bowel disease, in particular Crohn's 
disease or ulcerative colitis, and intestinal cancer. 
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98. Use of the non-human vertebrate animal according to any one of claims 72 
to 95 for studying the molecular mechanisms of, or physiological 
processes associated with, conditions associated with, or affected by, 
reduced activity or undesirable, e.g. increased, activity of endogenous 
AGR2; reduced expression, reduced production or undesirable, e.g. 
increased, production of endogenous AGR2; or for the identification and 
testing of an agent useful in the prevention, amelioration, or treatment of 
these conditions. 

99. The uses according to any one of claims 96 to 98, wherein said agent is 
selected from the group consisting of a small molecule drug, a 
(polypeptide, and a nucleic acid. 



15 100. The use of claim 99, wherein said agent is an antagonist of AGR2. 



10 



101 . The use of claim 99, wherein said agent is an agonist of AGR2. 



1 02. Use of the non-human vertebrate animal according to any one of claims 72 
20 to 95 for studying or identifying protein or nucleic acid diagnostic markers, 

such as an early gene diagnostic marker, for diseases associated with 
AGR2 deficiency or over-expression. 

103. Use of the non-human vertebrate animal according to any one of claims 72 
25 to 95 for identifying receptors of the AGR2 protein, or genes or proteins 

regulated by AGR2 activity and deregulated in disorders associated with 
AGR2 deficiency or overexpression. 



104. A method of identifying a protein or nucleic acid marker indicative of an 
30 increased risk of a human subject of developing a medical condition 

associated with an alteration in goblet cell function, said method 
comprising the step of analyzing a test sample derived from a human 
subject for the presence of a difference compared to a similar test sample if 
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derived from a human subject unaffected by or known not to be at risk of 
developing said condition, wherein said difference is indicative of the 
presence of a mutation in an allele of the gene coding for the AGR2 
protein according to SEQ ID NO:4, or in an allele of a gene coding for a 
5 protein which affects expression or function of said AGR2 protein. 

105. A method of identifying a protein or nucleic acid marker indicative of an 
association of a medical condition in a human subject which is associated 
with an alteration in goblet cell function with altered AGR2 expression or 

10 function, said method comprising the step of analyzing a test sample 

derived from a human subject for the presence of a difference compared to 
a similar test sample if derived from a human subject unaffected by or 
known not to be at risk of developing said condition, wherein said 
difference is indicative of the presence of a mutation in an allele of the 

15 gene coding for the AGR2 protein according to SEQ ID NO:4, or in an 

allele of a gene coding for a protein which affects expression or function of 
said AGR2 protein. 

106. The method of claims 104 or 105, wherein said test sample is analyzed for 
20 a difference compared to similar test samples if derived from a group of 

human subjects unaffected by, or known not to be at risk of developing, 
said condition. 

107. The method according to any one of claims 104 to 106, wherein said 
25 human subject whose test sample is analyzed has a condition or is known 

or suspected to be at risk of developing a condition associated with an 
alteration in goblet cell function. 



108. The method of any one of claims 1 04 to 1 07, further comprising the step of 
30 obtaining said similar test sample or said similar test samples from said 

human subject or group of human subjects unaffected by, or known not to 
be at risk of developing, said condition. 
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109. The method according to any one of claims 104 to 108, wherein said 
alteration results in a loss of function phenotype. 

110. The method according to any one of claims 104 to 108, wherein said 
5 alteration results in a gain of function phenotype. 

111. The method according to any one of claims 104 to 110, wherein said 
alteration is an alteration in goblet cell differentiation, particularly terminal 
differentiation, and/or goblet cell mucus production or secretion and/or 

10 mucus composition. 

112. The method according to any one of claims 104 to 109 or 111, wherein 
said alteration is characterized by a reduction in pre-mucin storing granules 
in the goblet cells, an altered mucus secretion, and secondary inflammatory 

15 infiltrations in the intestinal mucosal epithelium and submucosa. 

113. The method according to any one of claims 104 to 109 or 111 to 112, 
wherein said medical condition is furthermore associated with an increased 
proliferation of the glandular epithelium of the Brunner's gland. 



20 



114. The method according to any one of claims 104 to 109 or 111 to 113, 
wherein said alteration results in diarrhea. 



115. The method according to any one of claims 104 to 110, wherein said 
25 medical condition is selected from the group consisting of asthma, chronic 

obstructive pulmonary disease (COPD), cystic fibrosis, dry eye syndrome, 
gastric disease, peptic ulcer, inflammatory bowel disease, in particular 
Crohn's disease or ulcerative colitis, and intestinal cancer. 

30 116. The method according to any one of claims 104 to 115, wherein said test 
sample is a nucleic acid sample. 
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117. The method according to claim 116, wherein said nucleic acid is selected 
from the group consisting of mRNA and genomic DNA. 

118. The method according to claims 1 16 or 1 17, wherein the step of analyzing 
5 said nucleic acid sample comprises amplifying at least a portion of its 

nucleic acid via the polymerase chain reaction, and optionally also 
amplifying via the polymerase chain reaction at least a portion of the 
nucleic acid of said similar sample or said similar samples. 

10 1 19. The method according to any one of claims 104 to 115, wherein said test 
sample is. a protein sample. 

120. The method according to claim 119, wherein said protein is the AGR2 
protein. 

15 

121. The method according to claims 1 16 to 120, wherein said difference is one 
in the expression level of said nucleic acid or said protein. 

122. The method according to claims 1 16 to 120, wherein said difference is one 
20 in the nucleotide or amino acid sequence of said nucleic acid or said 

protein. 

123. The method according to claim 122, wherein the step of analyzing 
comprises the partial or complete determination of the sequence of the 

25 nucleic acid or the amplified portion of the nucleic acid of said sample, and 

optionally also of the nucleic acid or the amplified portion of the nucleic 
acid of said similar sample or said similar samples. 

124. The method according to any one of claims 104 to 123, wherein said allele 
30 is comprised in the genome of the germ cells of said human subject. 



125. 



The method according to any one of claims 104 to 123, wherein said allele 
is comprised in the genome of the somatic cells of said human subject. 
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126. The method according to claims 124 or 125, further comprising the step of 
determining whether the genome of said germ cells or said somatic cells is 
homozygous with respect to said mutation in said allele. 

5 

127. The method according to any one of claims 104 to 126, wherein said 
mutation results in a deletion or substitution by another amino acid of an 
amino acid of the AGR2 protein encoded by said allele, or an insertion of 
additional amino acids not normally present in the amino acid sequence of 

10 the AGR2 protein according to SEQ ID NO:4. 



128. The method according to claim 127, wherein said deletion, substitution, or 
insertion occurs in an evolutionary conserved region of said AGR2 protein. 



15 129. The method according to claims 127 or 128, wherein said mutation results 
in the substitution of an amino acid which is identical or similar between 
mouse, rat, and human AGR2, preferably between mouse, rat, human, and 
Xenopus laevis AGR2, more preferably between mouse, rat, human, 
Xenopus laevis, and Caenorhabditis elegans AGR2, by another amino 

20 acid. 

130. The method according to any one of claims 127 to 129, wherein the 
substitution of said amino acid of the AGR2 protein by another amino acid 
is a non-conservative substitution. 

25 

131. The method according to any one of claims 127 to 130, wherein said 
amino acid of the AGR2 protein that is deleted or substituted is Val 137. 

132. The method according to claim 131, wherein the substitution at position 
30 1 37 is one of the following substitutions: 

a) Val -> acidic amino acid such as Glu or Asp; 

b) Val -> basic amino acid, such as His, Arg or Lys; 

c) Val aliphatic hydroxyl side chain amino acid, such as Ser or Thr; 
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d) Val -> amide side chain amino acid, such as Asn or Gin; 

e) Val sulfur containing side chain amino acid, such as Cys or Met; 

f) Val -> aromatic side chain amino acid, such as Phe, Tyr, Trp; 

g) Val -> Gly or Pro; and 
5 h) Val -> Ala, Leu or lie. 

133. The method according to claim 132, wherein the substitution at position 
137 is a substitution of valine by glutamic acid. 

10 134. The method according to any one of claims 127 to 132, wherein said 
amino acid is substituted by a naturally occurring amino acid. 

135. A method for identifying a predisposition of a human subject for 
developing a medical condition associated with an alteration in goblet cell 

15 function, said method comprising the step of determining whether a test 

sample derived from said human subject indicates the presence of a 
mutation in an allele of the gene coding for the AGR2 protein according to 
SEQ ID NO:4 indicative of an increased risk of said human subject of 
developing said medical condition. 

20 

136. The method according to claim 135, further comprising the step of 
assigning a certain risk of developing said medical condition to said human 
subject. 

25 137. A method for determining whether a medical condition in a human subject 
which is associated with an alteration in goblet cell function is associated 
with altered AGR2 expression or function, said method comprising the 
step of determining whether a test sample derived from said human subject 
indicates the presence of a mutation in an allele of the gene coding for the 

30 AGR2 protein according to SEQ ID NO:4 indicative of an altered AGR2 

expression or function. 
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138. The method according to claim 137, further comprising the step of 
assigning an association with altered AGR2 expression or function to said 
human subject's medical condition. 

5 139. The method according to any one of claims 135 to 138, wherein said 
alteration results in a loss of function phenotype. 

140. The method according to any one of claims 135 to 138, wherein said 
alteration results in a gain of function phenotype. 

10 

141. The method according to any one of claims 135 to 140, wherein said 
alteration is an alteration in goblet cell differentiation, particularly terminal 
differentiation, and/or goblet cell mucus production or secretion and/or 
mucus composition. 

142. The method according to any one of claims 135 to 139 or 141, wherein 
said alteration is characterized by a reduction in pre-mucin storing granules 
in the goblet cells, an altered mucus secretion, and secondary inflammatory 
infiltrations in the intestinal mucosal epithelium and submucosa. 

143. The method according to any one of claims 135 to 139 or 141 to 142, 
wherein said medical condition is furthermore associated with an increased 
proliferation of the glandular epithelium of the Brunner's gland. 

25 144. The method according to any one of claims 135 to 139 or 141 to 143, 
wherein said alteration results in diarrhea. 



15 



20 



145. The method according to any one of claims 135 to 140, wherein said 
medical condition is selected from the group consisting of asthma, chronic 
obstructive pulmonary disease (COPD), cystic fibrosis, dry eye syndrome, 
gastric disease, peptic ulcer, inflammatory bowel disease, in particular 
Crohn's disease or ulcerative colitis, and intestinal cancer. 
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146. The method according to any one of claims 135 to 145 wherein said test 
sample is a nucleic acid sample. 

147. The method according to claim 146, wherein said nucleic acid is selected 
5 from the group consisting of mRNA and genomic DNA. 

148. The method according to any one of claims 135 to 145, wherein said test 
sample is a protein sample. 

10 149. The method according to claim 148, wherein said protein is the AGR2 
protein. 

150. The method according to any one of claims 135 to 149, wherein said allele 
is comprised in the genome of the germ cells of said human subject 

15 

151. The method according to any one of claims 135 to 149, wherein said allele 
is comprised in the genome of the somatic cells of said human subject 

152. The method according to claims 150 or 151, further comprising the step of 
20 determining whether the genome of said germ cells or said somatic cells is 

homozygous with respect to said mutation in said allele. 

153. The method according to any one of claims 135 to 152, wherein said 
mutation results in the reduction or complete abolishment of expression of 

25 the AGR2 protein. 

154. The method according to any one of claims 135 to 153, wherein said 
mutation results in a deletion or substitution by another amino acid of an 
amino acid of the AGR2 protein encoded by said allele, or an insertion of 

30 additional amino acids not normally present in the amino acid sequence of 

the AGR2 protein according to SEQ ID NO:4. 
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155. The method according to claim 154, wherein said deletion, substitution, or 
insertion occurs in an evolutionary conserved region of said AGR2 protein. 

156. The method according to claims 154 or 155, wherein said mutation results 
5 in the substitution of an amino acid which is identical or similar between 

mouse, rat, and human AGR2, preferably between mouse, rat, human, and 
Xenopus laevis AGR2, more preferably between mouse, rat, human, 
Xenopus laevis, and Caenorhabditis elegans AGR2, by another amino 
acid. 

10 

157. The method according to any one of claims 154 to 156, wherein the 
substitution of said amino acid of the AGR2 protein by another amino acid 
is a non-conservative substitution. 



15 158. The method according to any one of claims 154 to 157, wherein said 
amino acid of the AGR2 protein that is deleted or substituted is Val 137. 

159. The method according to claim 158, wherein the substitution at position 
137 is one of the following substitutions: 
20 a) Val acidic amino acid such as Glu or Asp; 

b) Val -> basic amino acid, such as His, Arg or Lys; 

c) Val aliphatic hydroxyl side chain amino acid, such as Ser or 
Thr; 

d) Val -> amide side chain amino acid, such as Asn or Gin; 

25 e) Val sulfur containing side chain amino acid, such as Cys or Met; 

f) Val -> aromatic side chain amino acid, such as Phe, Tyr, Trp; 

g) Val -> Gly or Pro; and 

h) Val -» Ala, Leu or He. 



30 160. The method according to claim 159, wherein the substitution at position 
137 is a substitution of valine by glutamic acid. 
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161. The method according to any one of claims 154 to 159, wherein said 
amino acid is substituted by a naturally occurring amino acid. 

162. The method according to any one of claims 135 to 152 and 154 to 161, 
5 wherein said gene codes for a AGR2 protein having the sequence set forth 

inSEQIDNO:30. 



163. A pharmaceutical composition comprising a protein or protein fragment 
according to any one of claims 1 to 21, a nucleic acid according to claim 
22 or 23, an episomal element or vector according to any one of claims 24, 
25, and 29 to 32, an antisense nucleic acid according to any one of claims 
36 to 41, an siRNA according to any one of claims 45 to 50, an antibody 
according to any one of claims 54 to 62, an immunoconjugate according to 
any one of claims 63 to 67, an anticalin according to claims 68 or 69, or an 
aptamer according to claims 70 or 71, and a pharmaceutical^ acceptable 
carrier. 



1 64. The protein or protein fragment according to any one of claims 1 to 21 , the 
nucleic acid according to claim 22 or 23, the episomal element or vector 

20 according to any one of claims 24, 25, and 29 to 32, the antisense nucleic 

acid according to any one of claims 36 to 41, the siRNA according to any 
one of claims 45 to 50, the antibody according to any one of claims 54 to 
62, the immunoconjugate according to any one of claims 63 to 67, the 
anticalin according to claims 68 or 69, or the aptamer according to claims 

25 70 or 71 for use as a medicament. 



165. The protein or protein fragment, nucleic acid, episomal element, vector, 
antisense nucleic acid, siRNA, antibody, aptamer, anticalin, . or 
immunoconjugate according to claim 164 for the use specified therein, 
wherein the medicament is for treating a medical condition in a human 
subject which is associated with an alteration in goblet cell function, said 
medical condition optionally being furthermore associated with an 
increased proliferation of the glandular epithelium of the Brunner's gland. 
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166. The protein or protein fragment, nucleic acid, episomal element, vector, 
antisense nucleic acid, siRNA, antibody, aptamer, anticalin, or 
immunoconjugate according to claim 165 for the use specified therein, 

5 wherein said alteration is an alteration in goblet cell differentiation and/or 

goblet cell mucus production or secretion and/or mucus composition. 

167. The protein or protein fragment, nucleic acid, episomal element, vector, 
antisense nucleic acid, siRNA, antibody, aptamer, anticalin, or 
immunoconjugate according to claim 165 or 166 for the use specified 
therein, wherein said alteration is characterized by a reduction in pre- 
mucin storing granules in the goblet cells, an altered mucus secretion, and 
secondary inflammatory infiltrations in the intestinal mucosal epithelium 
and submucosa. 

168. The protein or protein fragment, nucleic acid, episomal element, vector, 
antisense nucleic acid, siRNA, antibody, aptamer, anticalin, or 
immunoconjugate according to any one of claims 165 to 167, wherein said 
alteration results in diarrhea 

169. The protein or protein fragment, nucleic acid, episomal element, vector, 
antisense nucleic acid, siRNA, antibody, aptamer, anticalin, or 
immunoconjugate according to claims 164 or 165 for the use specified 
therein, wherein said medicament is for treating a condition selected from 
the group consisting of asthma, chronic obstructive pulmonary disease 
(COPD), cystic fibrosis, dry eye syndrome, gastric disease, peptic ulcer, 
inflammatory bowel disease, in particular Crohn's disease or ulcerative 
colitis, and intestinal cancer. 

170. A method of producing a mutant AGR2 protein comprising culturing a 
host cell according to any one of claims 33 to 35 in a suitable medium 
under conditions such that the protein is expressed, and harvesting the cells 
or the medium. 
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171. The method according to claim 170 > wherein the protein is subsequently 
further purified from said cells or said medium. 

172. A method of gene therapy comprising delivering to cells in a human 
subject suffering from or known to be at risk of developing a condition 
associated with an alteration in goblet cell function a DNA construct 
comprising 

(a) a sequence of an allele of the AGR2 gene encoding the human 
AGR2 protein according to SEQ ID NO:4, or encoding a protein 
having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 
99% amino acid identity compared to the mouse Agr2 or the human 
AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4, 
respectively; or a sequence of an allele of the AGR2 gene of a 
human subject unaffected by or known not to be at risk of 
developing said condition; 

(b) a DNA sequence encoding the human AGR2 protein according to 
SEQ ID NO:4, or a human AGR2 protein encoded by the AGR2 
gene of a human subject unaffected by or known not to be at risk of 
developing said condition, or a protein having at least 65%, 70%, 
75%, 80%, 85%, 90%, 95%, 98%, or 99% amino acid identity 
compared to the mouse Agr2 or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ IDNO:4, respectively; 

(c) a DNA sequence encoding an antisense nucleic acid according to 
any one of claims 36 to 41, or an antisense nucleic acid comprising 
a nucleotide sequence which is complementary to an mRNA 
encoded by the AGR2 gene of a human subject unaffected by or 
known not to be at risk of developing said condition; 

(d) a DNA sequence encoding an siRNA according to any one of 
claims 45 to 50; 

(e) a DNA sequence encoding an aptamer according to any one of 
claims 70 to 71; or 
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(f) a DNA sequence encoding an AGR2 protein according to any one 
of claims 1 to 21. 

173. The method of claim 172, wherein said human AGR2 gene of a subject 
5 unaffected by or known not to be at risk of developing said condition is a 

gene encoding a human AGR2 protein according to SEQ ID NO:4. 

174. The method of claims 172 or 173, wherein said cells are intestinal cells of 
said human subject, preferably goblet cells. 

10 

175. The method of claims 172 or 173, wherein said cells are gastrointestinal 
cells of said human subject, preferably goblet cells and/or mucus secreting 
cells of the Brunner's gland. 

15 176. The method of claims 172 or 173, wherein said cells are cells of the 
respiratory tract of said human subject, preferably goblet cells and/or 
mucus secreting cells of the submucosal glands of the trachea 

177. The method of any one of claims 172 to 176, wherein the DNA construct 
20 is a viral vector. 

178. The method of any one of claims 172 to 177, wherein said DNA construct 
is capable of directing expression of said protein, said antisense nucleic 
acid, or said siRNA. 

25 

179. The method of claim 178, wherein said expression is transient. 

180. The method of any one of claims 172 to 178, wherein the DNA construct 
is capable of being stably integrated into the genome of said cells. 

30 

181. The method of any one of claims 172 to 180, wherein said sequence of an 
allele of the AGR2 gene comprises coding sequences of said gene. 
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1 82. The method of any one of claims 1 72 to 181, wherein said sequence of an 
allele of the AGR2 gene comprises non-coding sequences of said gene, 

1 83 . Use of a DNA construct comprising 

5 (a) a sequence of an allele of the AGR2 gene encoding the human 

AGR2 protein according to SEQ ID NO:4, or encoding a protein 
having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 
99% amino acid identity compared to the mouse Agr2 or the human 
AGR2 protein according to SEQ ID NO:3 and SEQ ID NO:4, 
10 respectively; or a sequence of an allele of the AGR2 gene of a 

human subject unaffected by or known not to be at risk of 
developing said condition; 
(b) a DNA sequence encoding the human AGR2 protein according to 
SEQ ID NO:4, or a human AGR2 protein encoded by the AGR2 
15 gene of a human subject unaffected by or known not to be at risk of 

developing said condition, or a protein having at least 65%, 70%, 
75%, 80%, 85%, 90%, 95%, 98%, or 99% amino acid identity 
compared to the mouse Agr2 or the human AGR2 protein 
according to SEQ ID NO:3 and SEQ ID NO:4, respectively; 
20 (c) a DNA sequence encoding an antisense nucleic acid according to 

any one of claims 36 to 41, or an antisense nucleic acid comprising 
a nucleotide sequence which is complementary to an mRNA 
encoded by the AGR2 gene of a human subject unaffected by or 
known not to be at risk of developing said condition; 
25 (d) a DNA sequence encoding an siRNA according to any one of 

claims 45 to 50; 

(e) a DNA sequence encoding an aptamer according to any one of 
claims 70 to 71; or 

(f) a DNA sequence encoding an AGR2 protein according to any one 
30 of claims 1 to 21 

for the preparation of a pharmaceutical for the treatment of a condition 
associated with an alteration in goblet cell function, said condition 
optionally being furthermore associated with an increased proliferation of 
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the glandular epithelium of the Brunner's gland, or the prevention of said 
condition in a human subject known to be at risk of developing such 
condition. 

184. A method of preventing, treating, or ameliorating a medical condition in a 
human subject associated with an alteration in goblet cell function, said 
method comprising administering to said human subject a pharmaceutical 
composition comprising an agent capable of modulating AGR2 activity in 
said human subject. 

185. The method of claim 184, wherein said pharmaceutical composition is a 
pharmaceutical composition according to claim 163. 

186. The method according to claim 184, wherein said agent capable of 
modulating AGR2 activity in said human subject is 

(a) an isolated protein having the sequence of the human AGR2 protein 
according to SEQ ID NO:4, 

(b) an isolated protein having at least 65%, 70%, 75%, 80%, 85%, 
90%, 95%, 98%, or 99% amino acid identity compared to the 
mouse Agr2 or the human AGR2 protein according to SEQ ID 
NO:3 and SEQ ID NO:4, respectively, wherein said protein shows 
the same or essentially the same activity as the human AGR2 
protein according to SEQ ID NO:4 in an in vitro assay selected 
from the group consisting of a colon cell proliferation assay, a 
goblet cell mucus secretion assay, and a Xenopits laevis cement 
gland differentiation assay; 

(c) an isolated fragment of the protein according to (a) or (b) above 
comprising at least 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 50, 60, 70, 
80, 90, 100, 110, 120, 130, 140, 150, 160, 165, 170, 171, 172, 173, 
or 174 contiguous amino acids having said percentages of amino 
acid identity compared to the corresponding amino acids in SEQ ID 
NO:3 and SEQ ID NO:4, wherein said fragment shows the same or 
essentially the same activity as the human AGR2 protein according 
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to SEQ ID NO:4 in an in vitro assay selected from the group 
consisting of a colon cell proliferation assay, a goblet cell mucus 
secretion assay, and aXenopus laevis cement gland differentiation 
assay; 

5 (d) a fusion protein comprising a protein or protein fragment according 

to (a) to (c) above fused to another protein or protein fragment not 
having said percentages of amino acid sequence identity to any 
corresponding amino acids in SEQ ID NO:3 and SEQ ID NO:4; 
preferably fused to a protein unrelated to the mouse Agr2 or the 

10 human AGR2 protein according to SEQ ID NO:3 and SEQ ID 

NO:4, respectively; 

(e) an antibody specifically recognizing an epitope comprised within 
the human AGR2 protein according to SEQ ID NO:4, or within a 
human AGR2 protein encoded by the AGR2 gene of a human 

15 subject unaffected by or known not to be at risk of developing a 

medical condition associated with altered goblet cell function; or 

(f) an antisense nucleic acid comprising a nucleotide sequence which 
is complementary to an mRNA encoded by the AGR2 gene of a 
human subject unaffected by or known not to be at risk of 

20 developing said condition, preferably encoded by the AGR2 gene 

encoding the human AGR2 protein according to SEQ ID NO:4. 

187. Use of an agent capable of modulating AGR2 activity for the preparation 
of a pharmaceutical for preventing, treating, or ameliorating a medical 
25 condition in a human subject associated with an alteration in goblet cell 

function, said condition optionally being furthermore associated with an 
increased proliferation of the glandular epithelium of the Brunner's gland. 



188. The use according to claim 187, wherein said agent is selected from the 
30 group consisting of a protein or protein fragment according to any one of 

claims 1 to 21, a nucleic acid according to claim 22 or 23, an episomal 
element or vector according to any one of claims 24, 25, and 29 to 32, an 
antisense nucleic acid according to any one of claims 36 to 41, an siRNA 
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according to any one of claims 45 to 50, an antibody according to any one 
of claims 54 to 62, an immunoconjugate according to any one of claims 63 
to 67, an anticalin according to claims 68 or 69, and an aptamer according 
to claims 70 or 71. 

The use according to claim 187, wherein said agent capable of modulating 
AGR2 activity in said human subject is 

(a) an isolated protein having the sequence of the human AGR2 protein 
according to SEQ ID NO:4, 

(b) an isolated protein having at least 65%, 70%, 75%, 80%, 85%, 
90%, 95%, 98%, or 99% amino acid identity compared to the 
mouse Agr2 or the human AGR2 protein according to SEQ ID 
NO:3 and SEQ ID NO:4, respectively, wherein said protein shows 
the same or essentially the same activity as the human AGR2 
protein according to SEQ ID NO:4 in an in vitro assay selected 
from the group consisting of a colon cell proliferation assay, a 
goblet cell mucus secretion assay, and a Xenopus laevis cement 
gland differentiation assay; 

(c) an isolated fragment of the protein according to (a) or (b) above 
comprising at least 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 50, 60, 70, 
80, 90, 100, 110, 120, 130, 140, 150, 160, 165, 170, 171, 172, 173, 
or 174 contiguous amino acids having said percentages of amino 
acid identity compared to the corresponding amino acids in SEQ ID 
NO:3 and SEQ ID NO:4, wherein said fragment shows the same or 
essentially the same activity as the human AGR2 protein according 
to SEQ ID NO:4 in an in vitro assay selected from the group 
consisting of a colon cell proliferation assay, a goblet cell mucus 
secretion assay, and a Xenopus laevis cement gland differentiation 
assay; 

(d) a fusion protein comprising a protein or protein fragment according 
to (a) to (c) above fused to another protein or protein fragment not 
having said percentages of amino acid sequence identity to any 
corresponding amino acids in SEQ ID NO:3 and SEQ ID NO:4; 
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preferably fused to a protein unrelated to the mouse Agr2 or the 
human AGR2 protein according to SEQ ID NO:3 and SEQ ID 
NO:4, respectively; 

(e) an antibody specifically recognizing an epitope comprised within 
5 the human AGR2 protein according to SEQ ID NO:4, or within a 

human AGR2 protein encoded by the AGR2 gene of a human 
subject unaffected by or known not to be at risk of developing a 
medical condition associated with altered goblet cell function; or 

(f) an antisense nucleic acid comprising a nucleotide sequence which 
10 is complementary to an mRNA encoded by the AGR2 gene of a 

human subject unaffected by or known not to be at risk of 
developing said condition, preferably encoded by the AGR2 gene 
encoding the human AGR2 protein according to SEQ ID NO:4. 



15 190. Use of a wild type AGR2 protein, e.g., an AGR2 wild type protein 
according to SEQ ID NO:4, a nucleic acid encoding such a protein, e.g., a 
nucleic acid having the nucleotide sequence set forth in SEQ ID NO: 5, or 
a small molecule agonist of AGR2 for preventing, treating, or ameliorating 
a medical condition in a human subject associated with an alteration in 
goblet cell function, where the medical condition is selected from the 
group consisting of dry eye syndrome, gastric disease, peptic ulcer 
inflammatory bowel disease, in particular Crohn's disease or ulcerative 
colitis, and intestinal cancer. 

191. Use of a small molecule antagonist of AGR2 for preventing, treating, or 
ameliorating a medical condition in a human subject associated with an 
alteration in goblet cell function, where the medical condition is selected 
from the group consisting of asthma, chronic obstructive pulmonary 
disease (COPD), and cystic fibrosis. 



192. A method of identifying an agent useful in the prevention, amelioration, or 
treatment of a goblet cell-related disorder, the method comprising 
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a) cultuiing mammalian goblet cells in the presence or absence of a 
candidate agent; and 

b) determining whether the presence of the agent results in an increase 
in the production by the cells of mucus and/or one or more 
particular mucus constituents; 

wherein said goblet cells show a reduced or no expression of the AGR2 
protein, or cany a mutation in one or both alleles of their endogenous 
AGR2 gene so that the allele is no longer capable of being expressed, or 
that it encodes a protein according to any one of claims 1 to 1 8. 

A method of identifying an agent useful in the prevention, amelioration, or 
treatment of a goblet cell-related disorder, the method comprising 

a) culturing mammalian goblet cells in the presence or absence of a 
candidate agent; and 

b) determining whether the presence of the agent results in a decrease 
in the production by the cells of mucus and/or one or more 
particular mucus constituents; 

wherein said goblet cells show an increased expression of the AGR2 
protein, or carry a mutation in one or both alleles of their endogenous 
AGR2 gene so that the allele shows an increased amount of expression or 
that it encodes a protein according to any one of claims 1 to 18. 

A method of identifying an antagonist of the AGR2 protein, the method 
comprising 

a) culturing mammalian goblet cells in the presence or absence of a 
wild-type mammalian AGR2 protein, preferably the mouse Agr2 
protein or the human AGR2 protein according to SEQ ID NO:3 and 
SEQ ID NO:4, respectively; and 

b) determining whether an increase in the production by the cells of 
mucus and/or one or more particular mucus constituents in the 
presence of said wild-type AGR2 protein is observed upon the 
addition of a candidate antagonist agent to the cultured cells. 
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195. A method of identifying an antagonist of the AGR2 protein, the method 
comprising 

a) cuituring mammalian goblet cells in the presence or absence of a 
wild-type mammalian AGR2 protein, preferably the mouse Agr2 

5 protein or the human AGR2 protein according to SEQ ID NO:3 and 

SEQ ID NO:4, respectively; and 

b) determining whether a decrease in the production of mucus and/or 
one or more particular mucus constituents by the cells which are 
cultured in the presence of the wild-type mammalian AGR2 protein 

10 is observed upon the addition of a candidate antagonist agent to the 

cultured cells. 

196. The method according to claims 194 or 195, wherein said goblet cells 
show a reduced or no expression of the AGR2 protein, or carry a mutation 

15 in one or both alleles of their endogenous AGR2 gene so that the allele is 

no longer capable of being expressed or that it encodes a protein according 
to any one of claims 1 to 18. 

197. The method according to claim 192, 193 or 196, wherein said cells are 
20 homozygous for said mutated endogenous AGR2 allele. 

198. The method according to any one of claims 192, 196 or 197, wherein said 
cells do not additionally contain a functional allele of a wild type AGR2 
gene (i.e., no functional allele of the corresponding wild type orthologue, 

25 or of aJieterplogous_Md-j^pg AGR2 gene), jor ^ajaucMc^j^Ls^^once, 

expressing a wild type AGR2 protein (representing either the 
corresponding wild type orthologue, or a heterologous wild type AGR2 
protein). 

30 199. The method according to any one of claims 192, 193 and 196 to 198, 
wherein said protein encoded by said mutated allele has the amino acid 
sequence set forth in SEQ ID NO:2 or SEQ ID NO:30. 
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200. The method according to any one of claims 192 to 199, wherein the mucus 
constituent is mucin2 or a trefoil peptide. 



201. The method according to claim 200, wherein it is determined whether the 
5 presence of the agent results in an increase in the production of both 

mucin2 and one or more of the trefoil peptides, or whether an increase in 
the production of both mucin2 and one or more of the trefoil peptides in 
the presence of said wild-type AGR2 protein is observed upon the addition 
of said candidate antagonist agent 

10 

202. The method according to claim 200, wherein it is determined whether the 
presence of the agent results in a decrease in the production of both 
mucin2 and one or more of the trefoil peptides, or whether a decrease in 
the production of both mucin2 and one or more of the trefoil peptides in 

15 the presence of said wild-type AGR2 protein is observed upon the addition 

of said candidate antagonist agent. 



203. The method according to claim 200 or 202, wherein the expression of 
mucin2 and/or the trefoil peptide(s) is determined via quantitative PCT 
analysis using muc2- and trefoil peptide-specific primers. 

204. The method according to any one of claims 192 to 203, wherein said 
mammalian goblet cells are LS174T or HT29 cells. 



205. The method according to any one of claims 192 to 204, wherein the 
candidate agent is selected from the group consisting of 

a) a peptide or polypeptide; 

b) a nucleic acid (including a peptide nucleic acid); and 

c) a small molecule having a molecular weight of no more than 2000 
Dalton, preferably no more than 1500 Dalton, more preferably no 
more than 1000 Dalton, and most preferably no more than 500, 
400, 300, or even 200 Dalton. 
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206. An agent identified or identifiable by a method according to any one of 
claims 192 to 205. 

5 207. Use of an agent according to claim 206 for the prevention, amelioration, or 
treatment of a medical condition associated with an alteration in goblet cell 
function. 
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Figure 1A. 
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Mouse Agr 2 (Chr.12: 30,11 Mb) 
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Figure 2. 



BLAST 2 Sequences 
( http : //www . ncbi . nlm . nih , gov /blast /bl2seq/bl2 . html ) 



Sequence 1: mouse AGR2 (WT) ; SEQ ID No:3 
Sequence 2: human AGR2 (WT) ; SEQ ID No: 4 

NOTE: The statistics (bitscore and expect value) is calculated based on the 
size of nr database* 

Score = 323 bits (828), Expect = 4e-88- 

Identities - 160/175 (91%), Positives - 168/175 (95%) 
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Query: 1 MEKFSVSAILLLVAISGTLAKDTTVKSGAKKDPKDSRPKLPQTLSRGWGDQLIWTQTYEE 60 

MEK VSA LLLVA+S TLA+DTTVK GAKKD KDSRPKLPQTLSRGWGDQLIWTQTYEE 
Sbjct: 1 MEKIPVSAFLLLVALSYTLARDTTVKPGAIQCDTKDSRPKLPQTLSRGWGDQLIWTQTYEE 60 

Query: 61 ALYRSKTSNRPLWIHHLDECPHSQALKKVFAEHKEIQKLAEQFVLLNLVYETTDKHLSP 120 

ALY+SKTSN+PLM+IHHLDECPHSQALKKVFAE+KEIQKLAEQFVLLNLVYETTDKHLSP 
Sbjct: 61 ALYKSKTSNKPLMIIHHLDECPHSQALKKVFAENKEIQKLAEQFVLIiNLVYETTDKHLSP 120 

Query: 121 DGQWPRIVFVDPSLT|RADITGRYSNRLYAYEPSDTALLYDNMKKALKLLKTEL 175 

dgqyvpri+fvdpslt|raditgrysnrlyayep+dtall DNMKKALKLLKTEL 

Sbjct: 121 DGQYVPRIMFVDPSLTj^jRADITGRYSNRLYAYEPADTALLLDNMKKALKLLKTEL 175 
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Figure 3. 



Breed ing-Scheeme: 
A) F3 production 
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Figure 4. 
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Haplotypes Scheme 
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Figure 7. 
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Figure 9. 
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Figure 10* 
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Figure 11. 

Colon (wild type mouse) 




Colon (affected mouse) 
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Figure 12. 
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Colon (affected mouse) 
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Figure 13. 
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Colon (wild type mouse) 





Colon (affected mouse) 
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Figure 14. 



Brunner's gland (wild type mouse) 




Brunner's gland (affected mouse) 
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Figure 15A. 
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Figure 15B. 
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Figure 16. 



Software used: 

- MultAlin via http://prodes.toulouse.inra.fr/multalin/multalin.htna [Corpet. F. (1988) , 

Multiple sequence alignment with hierarchical clustering, Nucl. Acids Res., 16 {22} , 
10881-10890] 

- BOXSHADE 3.21 via http://www.ch.embnet.org/software/BOX_form.html 



Mm: Mus musculus; NP_035913 
Hs: Homo sapiens; NP_006399 

Rn: Rattus norvegicus; derived by Genewise on AC126809 
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Figure 17. 

Software used: 

- MultAlin via http://prodes.toulouse.inra.fr/multalin/multalin.html [Corpet. F. (1988), 

10881 P 10890?^ enCe alignment With hierarchica l clustering, Nucl. Acids Res./ 16 (22), 

- BOXSHADE 3.21 via http://www.ch.embnet.org/software/BOX_form.html 



Mm: Mus musculus; NP_035913 

Hs: Homo sapiens; NP_006399 

Rn: Rattus norvegicus; derived by Genewise on AC126809 

XI: Xenopus laevis; AAL26844 
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Figure 18. 



Software used: 

- Mult Al in via http://prodes.toulouse.inra.fr/multalin/multalin.html [Corpet. F. (19B8), 

Multiple sequence alignment with hierarchical clustering, Nucl. Acids Res., 16 (22), 
10881-10890] 

- BOXSHADE 3.21 via http://www.ch.embnet.org/software/BOX_forra.html 



Mm: Mus mus cuius; NP_035913 

Hs: Homo sapiens; NP__006399 

Rn: Rattus norvegicus? derived by Genewise on AC126809 

XI: Xenopus laevis; AAL26844 

Ce: Caenorhabditis elegans; NP_496599 
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Percentage of identical and similar amino acids: 32% 
Percentage of identical amino acids: 46% 
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Figure 19. 
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Figure 20. 
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